Back in 2016, I came across a prescient guide on data acquisition strategies for AI startups written by Moritz Mueller-Freitag, then co-founder of Twenty Billion Neurons (TwentyBN): We quickly became good friends and explored product applications for TwentyBN’s video understanding technology: large-scale crowd-acted video demonstrations of concepts, actions and situations that could endow machines with visual common sense and intuitive physics. The company was ultimately acquired by Qualcomm, where Moritz now serves as Director of Product Management.
In the web scraping and data marketplace area, I’d like to suggest Data Boutique, as a valid alternative to in-house scraping and the rising costs of using proxy networks to circumvent anti-bot technologies.
Thanks, this is very helpful!
Question: Any good papers or case studies on LLM-based dataset stitching?
Actually wish I knew more about their exit strategies. They are going to need them.
Thanks for sharing, interesting read!
In the web scraping and data marketplace area, I’d like to suggest Data Boutique, as a valid alternative to in-house scraping and the rising costs of using proxy networks to circumvent anti-bot technologies.