The most prominent data collection and labeling companies ranked by global prominence, funding activity, and market position. Showing the top 50 out of 561 companies tracked on Fsome.
| # | Company | Location | Employees | Total Funding | Last Round | Last Funded |
|---|---|---|---|---|---|---|
| 1 |
Deepgram
Deepgram provides a voice artificial intelligence platform for speech-to-text, t…
|
San Francisco | — | $215.9M | Series C | 2026-01-13 |
| 2 |
Mercor
Mercor is an AI-based hiring platform that improves the recruitment process by m…
|
San Francisco | — | $483.6M | Series C | 2025-10-27 |
| 3 |
Tanium
Tanium is an IT security firm that provides risk management, incident response, …
|
Kirkland | 1K - 5K | $1.0B | Secondary Market | 2021-06-30 |
| 4 |
Biobeat
Biobeat is a med-tech company that develops remote patient monitoring and data c…
|
Boca Raton | 11 - 50 | $58.0M | Series B | 2025-12-30 |
| 5 |
Scale
Scale AI provides a data-oriented platform that assists in the development of AI…
|
San Francisco | 1K - 5K | $15.9B | Corporate Round | 2025-06-10 |
| 6 |
Sinpex
Sinpex offers an AI-driven platform for automating KYC and KYB compliance checks…
|
Munich | 11 - 50 | $18.8M | Series A | 2026-01-19 |
| 7 |
Defined.ai
Defined.ai (former DefinedCrowd) enabling AI creators of the future.
|
Seattle | 101 - 250 | $78.6M | Series Unknown | 2022-01-19 |
| 8 |
AISquared
SaaS, On-Prem, AI Infrastructure, AI Model Library
|
Mountain View | 51 - 100 | $19.8M | Series A | 2024-04-17 |
| 9 |
Dataloop AI
DataLoop's data management and annotation platform streamlines the process of ge…
|
Herzliya | 51 - 100 | $49.0M | Series B | 2022-11-03 |
| 10 |
Sama
Sama is a training data partner trusted by organizations to develop accurate art…
|
San Francisco | 5K - 10K | $84.8M | Series B | 2021-11-04 |
| 11 |
iMerit
iMerit provides AI data solutions across computer vision and natural language pr…
|
San Jose | 5K - 10K | $36.3M | Series Unknown | 2021-11-01 |
| 12 |
Hive
Hive provides cloud-based AI solutions for understanding, searching, and generat…
|
San Francisco | 251 - 500 | $120.7M | Series D | 2021-04-21 |
| 13 |
Prolific
Building the most advanced global infrastructure for People Science.
|
Oxford | 101 - 250 | $33.5M | Series A | 2023-07-11 |
| 14 |
Handshake
Handshake is a college career network that helps students and recent graduates f…
|
San Francisco | 501 - 1K | $434.0M | Series F | 2022-01-19 |
| 15 |
Mappa.ai
Use Mappa to hire thoroughly vetted Latin American rockstars in just 48 hours.
|
Montevideo | 1 - 10 | $3.4M | Seed | 2025-09-09 |
| 16 |
HumanSignal
HumanSignal offers data labeling software and annotation tools to build accurate…
|
San Francisco | 51 - 100 | $29.0M | Series A | 2022-05-18 |
| 17 |
Clarifai
Clarifai's platform supports the full AI development lifecycle; including datase…
|
Fort Lee | 101 - 250 | $100.0M | Series C | 2021-10-15 |
| 18 |
Toloka
Toloka offers a data-centric environment that supports fast and scalable AI deve…
|
Luzern | 251 - 500 | $72.0M | Series Unknown | 2025-05-06 |
| 19 |
Gretel
Gretel is a multimodal synthetic data platform that leverages advanced generativ…
|
San Diego | 51 - 100 | $67.7M | Series B | 2021-10-07 |
| 20 |
V7
V7 is a powerful AI training data platform used to produce high-quality image an…
|
London | 51 - 100 | $43.0M | Series A | 2022-11-28 |
| 21 |
Funnel
Funnel designs and develops software/platforms that help marketers automate thei…
|
Stockholm | 251 - 500 | $133.8M | Series C | 2021-10-12 |
| 22 |
SuperAnnotate
SuperAnnotate is an AI data platform that unifies AI pipeline and simplifies dat…
|
San Mateo | 101 - 250 | $67.0M | Series B | 2025-07-15 |
| 23 |
FormAssembly
Secure data collection made simple.
|
Bloomington | 101 - 250 | $16.5M | Series Unknown | 2024-03-13 |
| 24 |
Fulcrum
Fulcrum is a SaaS platform allowing manufacturers to improve efficiency through …
|
Minneapolis | 11 - 50 | $39.9M | Series A | 2023-08-09 |
| 25 |
EthonAI
EthonAI is an AI-powered platform that detects and prevents manufacturing defect…
|
Z├╝rich | 11 - 50 | $24.9M | Series A | 2024-05-30 |
| 26 |
AfterQuery
Research lab investigating the boundaries of AI capabilities. Serving every fron…
|
San Francisco | 11 - 50 | $500K | Series A | 2026-04-09 |
| 27 |
Rep Data
Rep Data is a market research company that offers assistance in data collection …
|
New Orleans | 51 - 100 | $11.6M | Private Equity | 2025-03-13 |
| 28 |
Snorkel AI
Snorkel AI is an AI platform that accelerates data labeling by using machine lea…
|
Redwood City | 101 - 250 | $235.2M | Series Unknown | 2025-08-06 |
| 29 |
Centaur Labs
Centaur provides a platform and services for high quality data annotation
|
Boston | 11 - 50 | $35.4M | Series B | 2024-10-08 |
| 30 |
Troveo AI
Troveo AI provides licensed video content to AI developers for model training.
|
Austin | 11 - 50 | $5.5M | Seed | 2024-11-01 |
| 31 |
XOCEAN
XOCEAN delivers ocean data using uncrewed surface vessels for various applicatio…
|
Carlingford | 101 - 250 | $193.1M | Series C | 2025-01-09 |
| 32 |
Cumbuca
Develop your own payments infraestructure in Brazil using our license.
|
São Paulo | 11 - 50 | $3.1M | Seed | 2023-08-24 |
| 33 |
Flikforge
Flikforge is the trust infrastructure for generative AI video, providing data la…
|
Los Angeles | 11 - 50 | $1.4M | Series A | 2025-01-31 |
| 34 |
Browsi
Browsi is a marketing intelligence platform that tracks digital advertising acti…
|
New York | 51 - 100 | $17.0M | Series A | 2020-01-01 |
| 35 |
super.AI
Automate data extraction from complex documents with guaranteed results using su…
|
San Francisco | 11 - 50 | $18.3M | Series A | 2021-06-16 |
| 36 |
omica.ai
Omica turns real-world clinical data into structured training infrastructure for…
|
New York | 1 - 10 | $1.0M | Pre Seed | 2024-02-16 |
| 37 |
Octane AI
Octane AI provides an all-in-one platform for engaging quizzes, data collection,…
|
San Francisco | 11 - 50 | $10.8M | Series Unknown | 2021-07-20 |
| 38 |
Archive
Archive App is a digital asset manager built for modern digital marketers.
|
Miami | 11 - 50 | $8.1M | Seed | 2023-06-12 |
| 39 |
atla
Atla is an AI research and deployment company dedicated to enabling the safe dev…
|
London | 11 - 50 | $5.0M | Seed | 2023-12-07 |
| 40 |
Skyline Robotics
Skyline Robotics is a robotics and automation company that develops a cleaning s…
|
Tel Aviv | 11 - 50 | $22.6M | Corporate Round | 2024-12-20 |
| 41 |
Metapair
Email AI agent for GTM teams
|
San Francisco | 11 - 50 | $6.0M | Seed | 2021-09-12 |
| 42 |
TELUS Digital
TELUS Digital accelerates revenue growth and operational efficiency with AI-fuel…
|
Vancouver | c 10001 max | — | Private Equity | 2016-05-05 |
| 43 |
Docugami
Docugami is a Seattle-area document engineering startup that transforms how busi…
|
Kirkland | 11 - 50 | $11.9M | Seed | 2021-01-01 |
| 44 |
Enabled Intelligence
Enabled Intelligence provides secure and accurate data labeling services.
|
Arlington | 101 - 250 | $1.0M | Seed | 2021-09-13 |
| 45 |
AGEYE
Vertical Farm Ecosystems, Farm Management Software & AI Crop Intelligence
|
Raleigh | 11 - 50 | $1.8M | Seed | 2024-05-08 |
| 46 |
Mingle Health
Mingle Health develops an end-to-end quality-improvement platform that simplifie…
|
Sandy | 11 - 50 | $15.3M | Debt Financing | 2023-03-20 |
| 47 |
Field Agent
Field Agent is a provider of enterprise services that offers mobile research and…
|
Fayetteville | 51 - 100 | $13.2M | Private Equity | 2024-09-23 |
| 48 |
Liva AI
Liva AI provides real voice and video data for AI.
|
San Francisco | 1 - 10 | $3.0M | Seed | 2025-09-09 |
| 49 |
MyAgData
MyAgData is a cloud-based application that automates the data collection and rep…
|
Effingham | 11 - 50 | $5.5M | Debt Financing | 2018-08-23 |
| 50 |
Heex Technologies
Smart Data for AI development.
|
Paris | 11 - 50 | $10.3M | Seed | 2024-01-18 |