Data Lead - AI Agent Data Pipeline and Validation

wave group United Kingdom
Visa Sponsorship
Apply
AI Summary

Lead the end-to-end data pipeline for unstructured financial documents, directing AI agents to extract and validate data with high ownership. Build and maintain validation systems, work directly with institutional clients, and ensure data quality at row level. Requires 2-5 years of hands-on data product experience with AI-native tooling and Python/SQL fluency.

Key Highlights
Fully hands-on role with no management responsibilities
Directs AI agents for data extraction from raw financial documents
Owns data quality at row level with validation system development
Works directly with institutional clients explaining methodology
Salary up to ~£100k plus profit share and 0.75-1% equity
Located in Old Street with 4-5 office days per week
Key Responsibilities
Build datasets from raw unstructured financial documents
Direct AI agents for data extraction
Interrogate agent output and catch missed data
Encode judgement into validation systems
Work directly with institutional clients explaining methodology
Ensure data quality at row level
Technical Skills Required
Python SQL AI agents Claude Code Cursor OpenAI Agents SDK terminal codebases data environments
Benefits & Perks
Salary up to ~£100k
Profit share
Equity 0.75-1%
VISA sponsorship available
Nice to Have
Experience at financial data provider (Bloomberg, Refinitiv, Preqin, FactSet)
Built agents yourself
Experience with LLMs in production
Web scraping and document parsing at scale
Worked in small team (2-30 people) where owned whole function

Job Description


💻 Job Title: Data Lead (fully hands-on, no management)

💰 Salary: up to ~£100k + profit share

📈 Equity: 0.75 - 1%

📍 Location: Old Street (4-5 office days/week)

🔐 Company: Data processing AI Agents

👥 Team: ~5

💸 Funding: Pre-seed


About the company

This early stage start-up is processing hundreds of thousands of unstructured financial documents into clean, structured datasets for some of the world's largest financial institutions - producing the output of 50, with a team of 5.


Forecasting £1.5m revenue within their first 12 months, they have an immense potential - not based on hype or inflated valuations, but rather achieving mega productivity through intelligent application of AI agents.


Their mid term goal is ~£50m revenue with a sub-30 person team. What's in it for you?

  • Profit share. Cash in your account on regular basis - not a promise of a huge payout IF the company succeeds and sells.


About the role

At most data companies, a dataset is the output of a large analyst team. Here, it's the output of a fleet of AI agents - directed by one person who stakes their reputation on it being right.


That's this role. You're not downstream. You're not cleaning data someone else built. You start from the source - raw documents, filings, internet data - and you build the dataset.


You're the reason institutional clients - banks, hedge funds, investment firms - trust the data.


AI agents do the extraction. You direct them, interrogate the output, catch what they miss and encode your judgement into validation systems that make the whole pipeline better over time.


You'll also work directly with clients, explaining methodology to sophisticated buyers who need to understand what they're relying on.


This isn't QA. It's the highest-ownership, most client-visible position in the company. Your name will be on the data. And for that, you'll have a very senior seat at the table.


✅ Must have requirements:

  • Roughly 2-5 years working directly with data at a company where data is the core product - not as an analyst consuming clean datasets, but building them from messy, unstructured sources
  • Demonstrable obsession with data quality - you know what great data looks like because you've spent time making it from scratch
  • You go to the row level. A missing data point or unexplained anomaly bothers you until it's resolved - not flagged and forgotten
  • Genuinely AI-native: you've been using agentic tooling long enough to have opinions on it - Claude Code, Cursor, OpenAI Agents SDK or equivalent, used on daily basis
  • Python and SQL fluent - you've built and debugged pipelines, not just queried tables
  • Comfortable in the terminal, in codebases and in real-world messy data environments


👍 Bonus points for:

  • Experience at a financial data provider (Bloomberg, Refinitiv, Preqin, FactSet etc.) or in quant/ESG research
  • You've built agents yourself - not just used them
  • Experience with LLMs in production / agentic workflow design
  • Web scraping and document parsing at scale
  • Experience in a small team (2-30 people) where you owned the whole function


🛂 VISA sponsorship is available if needed (but you need to be already living in the UK)


Similar Jobs

Explore other opportunities that match your interests

Research Scientist

Programming
1h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

necessary ventures

United Kingdom

Senior Technical Leader - Arm Architecture Reference Manual

Programming
9h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Arm

United Kingdom

Operations Associate - Multifunctional Role in AI-Powered Investment Research

Programming
21h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

junior

United Kingdom

Subscribe our newsletter

New Things Will Always Update Regularly