Skip to main content



Using machine learning to index text from billions of images

The last year and a half I've led a project on the Dropbox Machine Learning team to take the computer vision/deep learning OCR pipeline I built the year before and automatically run it and several other advanced machine learning models on billions of images daily in Dropbox to extract text for search. This turned out to be one of the largest computational projects Dropbox has ever done. The feature went live yesterday.

Dive into the technical details of how we built this on the Dropbox technical blog.

Latest Posts

Creating a Modern OCR Pipeline Using Computer Vision and Deep Learning

Notes from Neural Network NIPS 2016 conference

Some Surprising Results of What it Would Take to Transport a Million People to Mars

Cloudless: Open Source Deep Learning Pipeline for Orbital Satellite Data

Ten Deep Learning Trends I Saw at NIPS 2015

NIPS Day 3 Posters

NIPS Day 3 Morning Sessions

NIPS Day 1: Tutorials on Scaling Deep Learning, Probabilistic Programming, and Reinforcement Learning

Intelligence Augmentation and the Myth of the “Golden Lost Age”

Personal Photos Model Using Deep Learning