Data Scientist
Library Systems & Services LLC - beltsville, MD
Apply NowJob Description
LAC Federal is seeking an Entry-Level Data Scientist to work at a major federal library in the Washington, DC area. The Data Scientist will work with a larger team to develop the information architecture and framework for an Open Access data repository containing scientific data from federally funded research. Under the direct of senior staff, the Data Scientist will be responsible for supporting the management, analysis, and utilization of scientific data within federal agency repositories. This role involves working closely with a team of librarians, information specialists, senior data scientists, data managers, and IT professionals to ensure the effective organization, accessibility, and integrity of scientific datasets. The incumbent will employ various data science techniques and tools to extract insights, support research initiatives, and enhance decision-making processes. This is a hybrid position with remote and onsite required. RESPONSIBILITIES: Data Management: Collaborate with data managers to ensure the proper organization, documentation, and storage of scientific datasets. Implement data quality control measures to maintain the accuracy, consistency, and completeness of repository contents. Develop and maintain data pipelines for the efficient extraction, transformation, and loading (ETL) of data from diverse sources. Data Analysis: Utilize statistical and machine learning techniques to analyze scientific data and extract meaningful insights. Conduct exploratory data analysis (EDA) to identify patterns, trends, and anomalies within large datasets. Develop predictive models to support forecasting, risk assessment, and decision-making processes. Data Visualization: Create clear and compelling visualizations to communicate findings and insights to stakeholders. Design interactive dashboards and reports to facilitate data exploration and interpretation. Ensure that visualizations adhere to best practices for data presentation and accessibility. Research Support: Collaborate with scientists and researchers to understand their data needs and provide analytical support for research projects. Assist in the design and execution of experiments and studies, including data collection and analysis. Contribute to the development of data-driven research strategies and methodologies. Technical Support: Provide technical assistance and training to users of scientific data repositories. Troubleshoot issues related to data access, analysis, and interpretation. Stay abreast of emerging technologies and best practices in data science, informatics, and related fields. Requirements Bachelor's degree in data science, computer science, statistics, mathematics, or a related field. Strong analytical and problem-solving skills, with a keen attention to detail. Proficiency in programming languages commonly used in data science (e.g., Python, R, SQL). Familiarity with data manipulation and analysis libraries (e.g., pandas, NumPy, scikit-learn). Experience with data visualization tools and techniques (e.g., Matplotlib, Tableau). Knowledge of database systems and query languages (e.g., SQL, NoSQL). Excellent communication and collaboration skills, with the ability to work effectively in a multidisciplinary team environment. Knowledge of scientific domains relevant to the federal agency's mission (e.g., environmental science, biology, geology) is a plus. Experience with cloud computing platforms and services (e.g., AWS, Azure, Google Cloud Platform) is desirable. Familiarity with data governance and compliance standards (e.g., FAIR principles, HIPAA, GDPR) is a plus. Benefits Health Care Plan (Medical, Dental & Vision) Retirement Plan (401k, IRA) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation, Sick & Public Holidays) Family Leave (Maternity, Paternity) Short Term & Long Term Disability Training & Development Wellness Resources
Created: 2024-11-05