Senior Data Scientist
Harvard University - Cambridge, MA
Apply NowJob Description
Location: USA - MA - Cambridge Business Title: Senior Data Scientist Salary Grade: 058 Time Status: Full-time Union: Non Union, Exempt or Temporary Additional Qualifications and Skills: Bachelor's or Master's Degree in Statistics, Data Science, Computer Science, Mathematics, Informatics, or other health data related field. Prior work within a research environment is essential, including familiarity with the research pipeline and the process of conducting research employing accepted scientific experimental practices. Knowledgeable of data engineering, data architecture, database management, and data visualization techniques, with high proficiency in data extraction and wrangling. Numerical methods, statistical analysis, machine learning, and deep learning. Experience fitting and interpreting a range of models, including GLM, GLMM, SEM, econometric models, machine learning models. The ability to create and maintain databases using libraries from Python, R in a Linux environment. Expertise in leveraging cloud platforms and high-performance computing environments, especially Amazon Web Services (AWS), for scalable data processing and analytics (EC2, S3, Redshift, Lambda), and machine learning tools (SageMaker, Glue, Athena) and cluster management and scheduling systems (e.g., slurm). Experience with data warehousing and ETL/ELT processes. Skilled in SQL, NoSQL databases, and data modeling techniques. Experience with big data technologies and ecosystems (e.g. Hadoop, Spark). CI/CD pipelines for data science projects and their reliable deployment. Assisting with release of models/products to the proper platform (e.g., a website, an interactive API, etc.), including infrastructure design. Background in scientific programming/scripting (Python, R, Stata, and C++). 3+ years of experience using either Python or R in a data science and/or research context required; 5+ years preferred with advanced skills in Python libraries for data science (Pandas, NumPy, sci-kit learn, TensorFlow/PyTorch) or experience using object-oriented programming systems in R (e.g., S3, S4, RC, R6). Adherence to best practices in scientific programming, including version control (Git), code review, unit testing, and documentation to ensure reproducibility and maintainability of data science projects. Proven track record of success in working in a cross-functional team in an agile environment. Excellent communication skills; able to simplify complex technical concepts to stakeholders. Detail-oriented expertise, with strong problem-solving skills to support research. Strong team player with a service mindset, able to guide researchers and is customer focused. Awareness of and aptitude to appropriately and effectively understand, respect, and adapt to cultural and identity"based differences within group environments. Additional Information: Candidates who move forward in the process may be asked to complete a coding exercise as part of the interview. Please note: Harvard University requires pre-employment reference and background screening. The Harvard Data Science Initiative is unable to provide work authorization and/or visa sponsorship. This position has a 90-day orientation and review period. This is a fully benefited, full-time Harvard University position that has been funded through 7/31/2027. There is the possibility of renewal, contingent on funding, university priorities and satisfactory job performance. Commitment to Equity, Inclusion, and Belonging: Harvard University views equity, inclusion, and belonging as the pathway to achieving inclusive excellence and fostering a campus culture where everyone can thrive. Benefits: We invite you to visit Harvard's Total Rewards website to learn more about our outstanding benefits package. Work Format: Hybrid (partially on-site, partially remote). While this position is primarily remote, travel to campus may be necessary based on business needs and the nature of work. #J-18808-Ljbffr
Created: 2025-03-10