Data Scientist
Insight Global - Nashville, TN
Apply NowJob Description
This range is provided by Insight Global. Your actual pay will be based on your skills and experience "” talk with your recruiter to learn more. Base pay range $120,000.00/yr - $140,000.00/yr Salary : $120-140k (depending on experience) A client of Insight Global is searching for a Data Scientist to join their team. This candidate: Designs and implements Natural Language Processing (NLP) pipelines for preprocessing and analyzing "noisy" or incomplete text data. Utilizes embeddings (e.g., word, sentence, multilingual) for semantic similarity and feature engineering in record linkage workflows. Updates NLP models for domain-specific tasks, such as abbreviation recognition and title normalization. Develops and trains machine learning models for match/no-match classification. Optimizes hyperparameters and enhances model performance. Deploys NLP and Machine Learning (ML) models into batch and streaming pipeline using Databricks. Manages model lifecycle, including versioning, deployment, and monitoring. Implements monitoring solutions to detect model drift and continuously refine solutions based on real-world performance. Collaborates with Data Analysts to extract actionable insights from datasets, including text data. Collaborates with Data Engineers to integrate NLP and ML models into scalable Extract, Transform, Load (ETL) pipelines. Partners with stakeholders to align technical solutions with business needs. Explores cutting edge NLP approaches, such as transformer-based models, for improving text matching. Evaluates new tools and frameworks, including vector databases, to enhance the AI/ML pipeline. Researches multilingual and cross-lingual NLP solutions for entity resolution. Desired Skillset: 5+ years of experience with NLP techniques, including tokenization, embedding generation, and text similarity measures. 5+ years of experience working in a large data environment. Proficiency in PySpark MLlib and Python libraries like Scikit-learn, TensorFlow, or PyTorch. Familiarity with transformer-based models (e.g., BERT, RoBERTa) for text representation and fine-tuning. Proficiency in SQL for querying and transforming large datasets. Seniority level Not Applicable Employment type Full-time Job function Information Technology #J-18808-Ljbffr
Created: 2025-03-01