Mercor

Brendan Foody - Upfront Ventures Summit

Mercor · 2026-03-26T22:34:23.897Z

"The most important problem in the world is what we do all day for work and how the knowledge work economy operates." - Brendan Foody, at Upfront Ventures Summit. Brendan sat down with Sundeep Peechu of Felicis to talk about the future of work, what's blocking enterprise AI, and why humans become more valuable as AI advances. Watch the full video at the link in the comments.

Software Development

San Francisco, California 694,862 followers

Defining the future of work

See jobs Follow

View all 6,247 employees

About us

Mercor is defining the future of work. We connect human expertise with leading AI labs and enterprises to train frontier models.

Website: mercor.com
External link for Mercor
Industry: Software Development
Company size: 51-200 employees
Headquarters: San Francisco, California
Type: Privately Held
Founded: 2023

Locations

Primary

San Francisco, California 94105, US

Get directions

Employees at Mercor

See all employees

Updates

Mercor

694,862 followers
16h
Report this post
We opened our London office this month to be closer to our experts and customers that are advancing the frontier of AI. We're currently hiring Strategic Project Leads to join our London-based team. Apply at the link in the comments.
11 Comments

Like Comment Share
Mercor

694,862 followers
1d
Report this post
Kimi K2.6 from Kimi (Moonshot AI) scores 27.9% at pass@1 on APEX-Agents AA from Artificial Analysis. The scores are evaluated on 452 of the 480 public tasks from our benchmark for long-horizon professional work in investment banking, management consulting, and corporate law. K2.6 (27.9%) is a substantial improvement over K2.5 (11.5%), putting it within 5 points of GPT-5.4 (xhigh) and Claude Opus 4.6 (Max) on professional services work.
2 Comments

Like Comment Share
Mercor

694,862 followers
2d
Report this post
Anthropic Claude Opus 4.7 (Max) is only the second model ever to cross 50% mean score on APEX-Agents, our benchmark for complex, long-horizon professional work in investment banking, corporate law, and management consulting. GPT 5.4 was first. Opus 4.7 is second. It places 3rd overall on the leaderboard at 33.9% Pass@1, and tops the investment banking leaderboard at 37.2%, beating out GPT 5.2 (xHigh). The most interesting finding is that Opus 4.7 thinks harder than its predecessor, and that comes at a token cost: roughly 2x Opus 4.6 at the same effort level. Check out the latest leaderboard at the link in the comments.
7 Comments

Like Comment Share
Mercor

694,862 followers
6d
Report this post
Mercor is in the Forbes AI 50 for the 2nd year in a row. Thank you to our experts, customers, and team for being a part of this. Advance the frontier of AI with us. We're hiring across nearly every function. Check out our open roles at the link in the comments.
24 Comments

Like Comment Share
Mercor

694,862 followers
1w
Report this post
Ayushi spent years building at the intersection of AI and healthcare, most recently as the founder of a healthcare AI startup. She knew what it felt like to search for product-market fit from the inside, and what it cost when you didn't find it. When she started thinking about what came next, she was deliberate. She wanted colleagues who understood founder life without her having to explain it. About 30% of people at Mercor are former founders. "After years of trying to build something from nothing, there is a specific energy in joining a team that's already sprinting and finding out you can keep pace." At Mercor, she's working on problems that only exist at scale, helping build the infrastructure that connects human expertise to AI advancement. Read Ayushi's story at the link in the comments.
19 Comments

Like Comment Share
Mercor

694,862 followers
2w
Report this post
We are excited to announce our collaboration with Artificial Analysis on APEX-Agents-AA — an independent, live leaderboard evaluating AI agents on the professional tasks that knowledge workers do every day. The leaderboard is built on APEX-Agents, Mercor's open-source benchmark of 480 tasks across investment banking, management consulting, and corporate law — including tool implementations, rubrics, and grading workflows, all available to the community for evaluation and training. Artificial Analysis runs a subset of these tasks through their open-source Stirrup harness, providing a reproducible, independent baseline that any team can verify and build on. APEX-Agents-AA results: 🥇 GPT-5.4: 33.3% 🥈 Claude Opus 4.6: 33.0% 🥉 Gemini 3.1 Pro Preview: 32.0% The top three frontier models are separated by just 1.3 percentage points. The leaderboard will update with key model releases. Check it out at the link in the comments.
24 Comments

Like Comment Share
Mercor

694,862 followers
3w
Report this post
The privacy and security of our customers and contractors is foundational to everything we do at Mercor. We recently identified that we were one of thousands of companies impacted by a supply chain attack involving LiteLLM. Our security team moved promptly to contain and remediate the incident. We are conducting a thorough investigation supported by leading third-party forensics experts. We will continue to communicate with our customers and contractors directly as appropriate and devote the resources necessary to resolving the matter as soon as possible.

35 Comments

Like Comment Share
Mercor

694,862 followers
3w Edited
Report this post
Does Training on APEX-Agents Dev Set Generalize Beyond the Benchmark? Applied Compute post-trained GLM-4.7 on ~2,000 expert Mercor tasks and achieved state-of-the-art legal performance on APEX-Agents. We then evaluated that model, AC-Small, on benchmarks outside its training distribution. On GDPVal, AC-Small's win+tie rate rose from 55.0% to 62.7% (+7.7pp), placing it 5th overall and ahead of Opus 4.5. To understand where the gain came from, we ran two ablations: On Toolathalon, AC-Small improved by +8.0pp, from 26.5% to 34.6%. On APEX, which removes tool use and agent loops, AC-Small moved up seven spots, beating Opus 4.5, Sonnet 4.5, and Grok 4. The biggest surprise was medicine. AC-Small placed 4th at 64.8%, ahead of GPT 5.4, Gemini 3.1 Pro, and o3, despite zero medical tasks in training. The gains appear to come from stronger procedural discipline: preserving sub-details, checking intermediate outputs, and catching logical errors. Read more at the links in the comment.
6 Comments

Like Comment Share
Mercor

694,862 followers
4w
Report this post
"The most important problem in the world is what we do all day for work and how the knowledge work economy operates." - Brendan Foody, at Upfront Ventures Summit. Brendan sat down with Sundeep Peechu of Felicis to talk about the future of work, what's blocking enterprise AI, and why humans become more valuable as AI advances. Watch the full video at the link in the comments.

Brendan Foody - Upfront Ventures Summit

11 Comments

Like Comment Share
Mercor reposted this
Adarsh H.

38K followers
1mo
Report this post
Traditional coding benchmarks do not reflect how software is actually built and maintained. That's why we built a new benchmark, APEX-SWE, in partnership with Cognition. It measures whether AI models can perform complex, real-world software engineering work to ship systems that work and debug them when they don't. APEX-SWE Leaderboard | Pass@1 🥇OpenAI GPT-5.3 Codex (High) at 41.5% 🥈Anthropic Opus 4.6 (High) at 40.5% 🥉Anthropic Opus 4.5 (High) at 38.7% Every frontier model fails on nearly 60% of real production tasks.

227 Comments

Like Comment Share

Browse jobs

Funding

Mercor 4 total rounds

Last Round

Series C Nov 27, 2025

US$ 350.0M

Investors

Felicis + 2 Other investors

See more info on crunchbase

Mercor

Software Development

San Francisco, California 694,862 followers

Defining the future of work

About us

Locations

Employees at Mercor

Kevin Spain

Emergence Capital•5K followers

Dharminder Singh

1K followers

Daniel Itzicovitch

micro1•3K followers

Marcos de Almeida Fugulin

Self-Employed•5K followers

Updates

Brendan Foody - Upfront Ventures Summit

Join now to see what you are missing

Similar pages

Crossing Hurdles

Profile.fyi (Acquired by Mercor)

Mercor

micro1

Turing

DataAnnotation

Mercor Intelligence

Outlier

Alignerr

Scale AI

Browse jobs

Engineer jobs

Voice Over Artist jobs

Writer jobs

Senior Software Engineer jobs

Analyst jobs

Developer jobs

Software Engineer jobs

Intern jobs

Copywriter jobs

Scientist jobs

Associate jobs

Manager jobs

Staff Software Engineer jobs

Full Stack Engineer jobs

Machine Learning Engineer jobs

Specialist jobs

Project Manager jobs

Editor jobs

Designer jobs

Python Developer jobs

Funding