Senior Data Scientist

analytica United State
Remote
Apply
AI Summary

Analytica is seeking a Data Scientist to support long-term federal client engagements projects in the DC Metro area. The role will apply statistical programming, modeling, visualization techniques, data mining, and forecasting skills to analyze challenging public sector problems. The ideal candidate will have a Master's degree in Statistics, Mathematics, Computer Science, or similar, with experience utilizing SAS, R, or Python to support NLP use cases.

Key Highlights
Data Scientist role
Federal client engagements
NLP and machine learning expertise
Key Responsibilities
Collect, clean, and prepare data sets for input into a computational model using Python
Demonstrate experience with NLP feature engineering methods
Select classification modeling techniques to fit the business problem
Investigate, report, and justify model results
Present the results of modeling activities and explain the relevance of results to the organization's business challenges
Technical Skills Required
Python NLP Machine Learning
Benefits & Perks
Competitive compensation
Employer paid health care
Training and development funds
401k match
Nice to Have
Experience with GenAI and Prompt Engineering
Experience in Databricks and MLFlow
Experience with machine translation and transcription of foreign language documents using Microsoft Azure translation services
Experience working in an AWS cloud environment

Job Description


Analytica is seeking a Data Scientist to support long term federal client engagements projects in the DC Metro area.  The role will apply statistical programming, modeling, visualization techniques, data mining, and forecasting skills to analyze challenging public sector problems. 

This position is fully remote.

Analytica has been recognized by Inc. for 3 consecutive years as one of the 250 fastest growing business.  We offer competitive compensation with opportunities for bonuses, employer paid health care, training and development funds, and 401k match.  

Responsibilities include:

  • Pre-processing - Demonstrate the skills and experience to collect, clean, and prepare data sets for input into a computational model using Python. Strong candidates will explain various methods you have applied using common pre-processing functions such as stop word removal, stemming, lemmatization, and tokenization.
  • Feature Engineering and Attribute Evaluation - Candidate must demonstrate experience with NLP feature engineering methods such as TF-IDF, word2vec, GloVe, and FastText identifying the key determinants for modeling that exist in the business process and within existing data sets as well as selecting evaluation protocols (model techniques).
  • Modeling - Candidates will have practiced skills and experience selecting classification modeling techniques to fit the business problem. Examples will include techniques such as machine learning (ML) supervised and unsupervised learning, regression, neural networks and deep learning, natural language processing, etc.
  • Validation - Strong candidates will describe their experience with investigating, reporting, and justifying model results.
  • Visualization- Experience in presenting the results of their modeling activities, depicting the insights realized, and explaining the relevance of their results to the organization’s business challenges.
Qualifications:
  • Master's degree required, and PhD preferred in Statistics, Mathematics, Computer Science, or similar
  • High degree of experience utilizing SAS, R, or Python to support NLP use cases such as Document Summarization, Named Entity Recognition, Sentiment Analysis, and/or Topic Modeling
  • At least four years of experience developing scalable, production-ready NLP solutions using sci-kit learn, Keras, TensorFlow, PyTorch, Spark NLP.
  • Experience using git/github to version control source code
  • Experience leveraging transformer architecture to develop NLP models
  • Experience with open source NLP packages such as Gensim, SpaCy, or NLTK.
  • Experience with BERT, GPT-J, RoBERTa, T5 or other transformers
  • Experience with GenAI and Prompt Engineering is a plus
  • Experience in Databricks and MLFlow is a plus
  • Experience with machine translation and transcription of foreign language documents using Microsoft Azure translation services is a plus
  • Experience working in an AWS cloud environment and with related AWS services such as Bedrock and Textract
  • Experience coordinating and maintaining user stories
  • Must be a US citizen
  • Must be able to obtain and maintain a Public trust security clearance

About ANALYTICA: Analytica is a leading consulting and information technology solutions provider to public sector organizations supporting health, civilian, and national security missions. Founded in 2009 and headquartered in Bethesda, MD, the company is an established SBA small business that has been recognized by Inc. Magazine each of the past three years as one of the 250 fastest-growing companies in the U.S.  Analytica specializes in providing software and systems engineering, information management, analytics & visualization, agile project management, and management consulting services. The company is appraised by the Software Engineering Institute (SEI) at CMMI® Maturity Level 3 and is an ISO 9001:2008 certified provider. 


Similar Jobs

Explore other opportunities that match your interests

Business Analyst, Makai Labs

Data Science
5h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Associate

sundayy

United State

Senior Data Analyst - AI-Powered Data Access

Data Science
13h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Tremendous

United State

Senior Data Scientist - AI/ML & GenAI

Data Science
14h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Cloudflare

United State

Subscribe our newsletter

New Things Will Always Update Regularly