Lead Data Scientist, AI Knowledge Retrieval Systems

averity โ€ข United State
Relocation
Apply
AI Summary

Lead the architecture and delivery of next-generation agentic knowledge retrieval and deep research systems for a defense technology company. Define the technical roadmap, design and deploy advanced AI systems for complex data analysis and national security applications. Requires 7+ years of experience in Data Science, expert knowledge of RAG, knowledge graphs, and multi-agent AI.

Key Highlights
Own the architecture and delivery of next-generation agentic knowledge retrieval and deep research systems.
Set the technical direction for AI knowledge retrieval initiatives and a team of data scientists and AI engineers.
Develop production AI systems with direct impact on national security clients.
Key Responsibilities
Architect and lead delivery of a complex, multi-modal intelligent agent system for knowledge retrieval and deep research.
Define the multi-year technical roadmap for AI knowledge retrieval initiatives.
Design and deploy agentic AI systems capable of deep conversational knowledge exploration, robust problem solving, and accurate question answering over large, heterogeneous proprietary datasets.
Lead development of hybrid knowledge retrieval systems combining embedding models, knowledge graphs, and traditional methods using paradigms such as GraphRAG and RAG-RL.
Drive state-of-the-art ranking and reranking pipelines within production search and retrieval systems.
Build and deploy multi-agent AI reasoning frameworks for complex knowledge exploration and decision support.
Collaborate with product, data engineering, and software development teams to translate high-level requirements into production-ready AI solutions.
Mentor and elevate junior and senior engineers.
Champion AI innovation and stay current with the LLM and agentic AI landscape.
Conduct internal workshops and lead technical discussions to nurture a strong AI culture across the organization.
Technical Skills Required
Retrieval-Augmented Generation (RAG) LLMs Multi-modal embedding models Vector databases Knowledge graph systems GraphRAG RAG-RL Bi-encoder Cross-encoder Multi-tower architectures ReAct Chain-of-Thought BM25 AI coding tools
Benefits & Perks
Base Salary: $200,000
20-30% bonus
Equity: $125,000 - $175,000
Unlimited PTO
401(k)
Professional development reimbursement
Relocation assistance

Job Description


We are a fast-growing defense technology company whose AI-powered intelligence platform is deployed across the full spectrum of defense acquisition โ€” supply chain, science & technology, sustainment, production, and modernization. Our software gives the acquisition community the speed and analytical depth to rapidly imagine, produce, and field critical warfighting capabilities, turning data into strategic advantage for national security clients. We have offices in the Washington DC metro area and Pittsburgh, Pennsylvania.

What's The Role?

We are looking for an exceptional Lead Data Scientist to own the architecture and delivery of our next-generation agentic knowledge retrieval and deep research systems. This is a senior individual-contributor leadership role โ€” you will be the primary technical authority on search science, RAG, knowledge graphs, and multi-agent AI while also setting the direction for a team of data scientists and AI engineers. If you thrive on turning ambiguous, hard problems into production AI systems that matter to national security, this role was built for you.


What Youโ€™ll Do

  • Architect and lead delivery of a complex, multi-modal intelligent agent system for knowledge retrieval and deep research โ€” the central nervous system of our AI platform.
  • Define the multi-year technical roadmap for AI knowledge retrieval initiatives, translating vague business challenges into concrete, scalable research and product goals.
  • Design and deploy agentic AI systems capable of deep conversational knowledge exploration, robust problem solving, and accurate question answering over large, heterogeneous proprietary datasets.
  • Lead development of hybrid knowledge retrieval systems combining embedding models, knowledge graphs, and traditional methods (e.g., BM25) using paradigms such as GraphRAG and RAG-RL.
  • Drive state-of-the-art ranking and reranking pipelines โ€” bi-encoder, cross-encoder, multi-tower architectures โ€” within production search and retrieval systems.
  • Build and deploy multi-agent AI reasoning frameworks (ReAct, Chain-of-Thought) for complex knowledge exploration and decision support.
  • Collaborate with product, data engineering, and software development teams to translate high-level requirements into production-ready AI solutions integrated with client data platforms.
  • Mentor and elevate junior and senior engineers, fostering a culture of technical excellence, continuous learning, and high-impact delivery.
  • Champion AI innovation: stay current with the rapidly evolving LLM and agentic AI landscape and actively transfer knowledge across the team.
  • Conduct internal workshops and lead technical discussions to nurture a strong AI culture across the organization.
  • Spend some time on site with clients - approximatley 10% travel - mostly between Pittsburgh and the D.C area.


What Skills Do I Need?

  • 7+ years of professional experience as a Data Scientist in an industry setting.
  • Expert-level knowledge of Retrieval-Augmented Generation (RAG) architectures and complex indexing strategies for large-scale, heterogeneous datasets.
  • Proven ability to build production-ready knowledge retrieval, conversational AI search, and deep research systems leveraging state-of-the-art LLMs, multi-modal embedding models, vector databases, and knowledge graph systems.
  • Deep hands-on experience with hybrid retrieval combining embedding models, knowledge graphs, and traditional methods; practical knowledge of GraphRAG, RAG-RL, and related advanced retrieval paradigms.
  • Expert knowledge of ranking and reranking models (bi-encoder, cross-encoder, multi-tower) in search and knowledge retrieval contexts.
  • Strong proficiency building and deploying multi-agent AI systems and reasoning frameworks for complex knowledge exploration.
  • Experience using AI coding tools (e.g., Claude Code or similar) to accelerate team deliveries at scale.
  • Proven track record owning end-to-end execution of large-scale AI/ML projects from ideation to production.
  • Excellent communication skills: able to translate complex AI concepts for both technical and non-technical audiences.
  • You must be a U.S. Citizenship

Compensation

  • Base Salary based upon experience but expect around $200,000
  • 20-30% bonus - paid quarterly
  • Equity - between $125,000 - $175,000 with 4 year vest
  • Unlimited PTO
  • 401(k) with 4% immediate vesting
  • Professional development reimbursement
  • We can also provide relocation assistant if you need to move to the Pittsburgh or D.C. area.

Why Join Us

This role offers the rare combination of cutting-edge AI research, real-world national security impact, and the autonomy to define and build systems that matter. Youโ€™ll work on hard problems with exceptional colleagues, on a platform that is genuinely changing how the defense acquisition community operates.

We are also targeting an IPO in the near future, so your equity will actually worth something here!


Similar Jobs

Explore other opportunities that match your interests

Senior ServiceNow Business Analyst

Data Science
โ€ข
2h ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

Info Services

United State

Data Analyst II

Data Science
โ€ข
13h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

โ€ขโ€ขโ€ขโ€ขโ€ขโ€ข โ€ขโ€ขโ€ขโ€ขโ€ขโ€ข โ€ขโ€ขโ€ขโ€ขโ€ขโ€ข
Job Type โ€ขโ€ขโ€ขโ€ขโ€ขโ€ข
Experience Level โ€ขโ€ขโ€ขโ€ขโ€ขโ€ข

pacific life

United State

Data and Reporting Analyst

Data Science
โ€ข
14h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

Lensa

United State

Subscribe our newsletter

New Things Will Always Update Regularly