Our client, PhotoStat, is an online platform for photographers and artists to find out where and how their images are being used online. PhotoStat enables artists to find and resolve cases of unauthorized usage of their images.
PhotoStat has helped more than 100,000+ artists and agencies uncover and handle copyright infringement claims worldwide through tech and legal support.
Disclaimer: The name PhotoStat is a placeholder, as there is an NDA signed between both parties.
With the internet and the overall accessibility it provides, image theft has increased exponentially over the years. Whether out of sheer unawareness or willful malice, many people steal the photographer's intellectual property without permission or any credit.
One such instance happened in 2012 when John (name changed to maintain confidentiality), a photographer, was battling online image theft. It led him to build the platform PhotoStat, wherein artists and photographers upload their portfolio/images and check if someone has used their work anywhere else on the internet in an unauthorized manner.
PhotoStat then performs image matching and searches for similar images from millions on the internet. The challenge here was that the platform was prone to a high level of false-positive and false-negative results, leading to incorrect results and confusion. This also negatively affected the company's brand image and customer base.
The platform was based on a logistic regression model with a precision of less than 65%. It gave out many incorrect results for the users, which then needed the team to analyze and classify these images manually.
To fix this problem, the client started looking for a solution that measures the exact degree of visual similarity between the images instead of simply classifying the images into positive and negative results. For fast searching and indexing in the future, the platform should also have the added feature of hashing / applying fingerprints to the images being compared. Consequently, the client also wanted the platform to support different file formats of images, viz. GIF, PNG, JPG, JPEG, TIFF, WEBP, and PDF.
The founders of PhotoStat were looking for companies specializing in Computer Vision services via Clutch and stumbled upon Maruti Techlabs' profile. A couple of calls and meetings later, it was clear that both companies aligned well, and Maruti Techlabs was signed as their Computer Vision Services partner.
These were some of the reasons Maruti Techlabs stood out to the founders:
• MarutiTech's expertise and experience in Image Recognition and Segmentation
• Our work in Object Recognition, OCR, and Process Automation
• Our expertise in refining and sorting different datasets
• High degree accuracy of the AI and ML models we have built
• Reviews from our clients on the Maruti Techlabs Clutch profile
1. Feasibility Study:
After thoroughly understanding the client's use case and requirement, the first thing we worked on was conducting a feasibility study spanning four weeks, wherein our AI experts defined the scope of the solution and conducted a detailed analysis of the platform's current state.
During the feasibility study, our AI experts work with a sample of the image datasets (provided by the client) to determine the feasibility of the desired outcome. After creating the training dataset acquired from the client's database, we filtered, organized, and labeled the dataset to make it search-friendly. The labeled dataset then underwent meticulous quality checks like adding or removing pixels, removing noise, and sorting misclassified data.
After the data was processed, our engineers then leveled the dataset using various techniques like flipping, cropping, blurring, zooming, and compressing as required. Fundamentally, our AI experts studied, preprocessed, matched the sample data, and defined approaches they would take to build the search engine.
2. Development:
Once we completed the feasibility study and refined the training datasets, the actual development work began. With the help of training datasets, we designed a search engine, using computer vision technology, that gives similar user images for the given input image (uploaded by the artist).
Along with the similar images, the search engine also shows the percentage of similarity for each resultant image in the form of similarity distance. For easy interpretation of data, we created three buckets wherein the search engine classifies the results based on the similarity distance score. The three buckets are:
• Exact Match
• Similar Match
• No Match
We built the image search engine as a 5-layered architecture wherein:
1. The first layer preprocesses the images for noise removal, normalization, filtration, resizing, grayscale conversion, etc.
2. The second layer acts as the feature extractor and feature hash generator. This layer extracts the features of images in comparison. For the feature hash generator, we built a hash generation algorithm based on the features extracted for the images.
3. We built an index tree for the third layer to enable faster searching of images based on their feature vectors.
4. The incoming image bucketing is done based on similarity distance calculation. If the initial distance scoring via feature hash matching does not give a confident result (distance score more than a certain threshold value), then the fourth level of logic matches the feature vectors. And it calculates the percentage for features of user image matching in the public image to accommodate picture-in-picture similarity scoring.
5. An additional fifth layer was built for logo detection wherein a separate model was built and trained on labeled data of logo images.
We follow Agile, Lean, & DevOps best practices to create a superior prototype that brings your users’ ideas to fruition through collaboration & rapid execution. Our top priority is quick reaction time & accessibility.
We really want to be your extended team, so apart from the regular meetings, you can be sure that each of our team members is one phone call, email, or message away.
Finding the right AI partner is no easy task.
Building an AI product that delivers on value, is even more challenging.
We help you get started with a slightly different approach. Before we get into the trenches and kickstart development, we take a top-down approach with an AI Readiness Audit.
This involves really validating the idea, through qualitative and quantitative analysis of your datasets, identifying the best fit approach to model development, and putting together an implementation roadmap.
All this before writing a single line of code, and investing heavily into the idea.
Achieve more with less.