Using machine learning to index text from billions of images

The last year and a half I've led a project on the Dropbox Machine Learning team to take the computer vision/deep learning OCR pipeline I built the year before and automatically run it and several other advanced machine learning models on billions of images daily in Dropbox to extract text for search. This turned out to be one of the largest computational projects Dropbox has ever done. The feature went live yesterday.

Dive into the technical details of how we built this on the Dropbox technical blog.

