Data Scientist
Massachusetts General Hospital(MGH) - boston, MA
Apply NowJob Description
GENERAL SUMMARY/ OVERVIEW STATEMENT: The Albers Lab is a research laboratory of the MassGeneral Institute for Neurodegenerative Disease (MIND) that develops and applies integrative computational methods in biomedical and brain research to develop new therapeutic strategies for neurodegenerative diseases. We are seeking a highly motivated, innovative, and independent Data Scientist to be part of our team. While the position is primarily computational, the successful candidate will be working with a highly interdisciplinary team of computational, clinical and bench researchers at the Mass General Hospital and Harvard Medical School providing bioinformatics support for translational projects. You will have the opportunity to analyze both internal data (cellular and iPSC-derived neuron models of TDP43+ ALS and Alzheimer's disease and Related Dementias) and external datasets (MGH-ADRC, UK Biobank, NYGC, electronic health record data from MGB, UK, and ENACT enclave etc) to assess the human relevance of disease phenotypes and pharmacological rescue in our cellular, iPSC-derived neuron and mouse models of disease. The Data Scientist will also interact with lab members emulating clinical trials in electronic health records using federated learning of target trial emulation with collaborators conducting molecular dynamic simulation for drug discovery, and with medicinal chemists conducting in silico screening. The Data Scientist will lead the standardization of bioinformatic pipelines and data management of data being generated across the teams. Scientific insights resulting from this research are expected to be presented at external scientific conferences and published in high impact journals. PRINCIPAL DUTIES AND RESPONSIBILITIES: • Establishing and standardizing bioinformatics pipelines relevant to translational project • Analysis of genomic, transcriptomic (scRNA/bulk RNA), proteomic internal and external data set • Further develop DRIAD (Drug Repurposing in Alzheimer's Disease) platform by incorporating TDP43 driven cryptic exon detection/staging • Development of TRIALS (Therapeutic Repurposing in ALS) platform by using machine learning to develop predictors of ALS disease progression. • Provide support for federated learning efforts to emulate clinical trials using electronic health records from US, Europe and Asia. • Prepare data packages for regulatory bodies, such as NIH and FDA. • Build and process input datasets for machine-learning models. • Write code using a collaborative version control system, ensuring proper documentation and reproducible workflows. • Work closely with other computational scientists, researchers and physicians to design and perform analyses. • Assist in preparation of manuscripts as well as abstracts and presentations for scientific meetings. • Lead harmonization of data schema across different data ecosytems and data management across group Qualifications SKILLS/ABILITIES/COMPETENCIES REQUIRED: • Independent, highly motivated, and highly collaborative with the ability to work together with multi-disciplinary teams of computational and clinical researchers as well as laboratory biologist • Enthusiastic about working in a drug discovery and development centric scientific environment • Curious and quick learner, with a willingness to explore about new areas and build expertise, takes initiative to see your ideas implemented • Strong programming skills in R and Python are required. • Comfortable using Linux environments. • Experience with statistical analysis and databases is strongly preferred. • Experience in multi-omics datasets is required • Excellent organizational and communication skills - demonstrated ability to work well within multi-disciplinary teams. • Ability to work independently and take initiative when necessary. • Highly motivated and able to meet deadlines. • Knowledge of neuroscience is desirable. EDUCATION: Bachelor's Degree required. PhD in Computer Science, Bioinformatics, or related quantitative discipline is preferred. EXPERIENCE: 0-2 years of experience in data science projects is required. SUPERVISORY RESPONSIBILITY: 1 Bachelor's level Data Analyst. WORKING CONDITIONS: Work will be performed in a typical office setting with remote work as a possibility. EEO Statement Massachusetts General Hospital is an Affirmative Action Employer. By embracing diverse skills, perspectives and ideas, we choose to lead. All qualified applicants will receive consideration for employment without regard to race, color, religious creed, national origin, sex, age, gender identity, disability, sexual orientation, military service, genetic information, and/or other status protected under law. We will ensure that all individuals with a disability are provided a reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment.
Created: 2024-11-05