Data Scientist 3
Huntington Ingalls Industries - Arlington, VA
Apply NowJob Description
Location: Arlington, VA, Alabama, United States Requisition Number: 21796 Required Travel: 0 - 10% Employment Type: Full Time/Salaried/Exempt Anticipated Salary Range: $114,236.00-$160,000.00 Security Clearance: Secret Level of Experience: Senior This opportunity resides with Warfare Systems (WS) , a business group within HII's Mission Technologies division. Warfare Systems comprises cyber and mission IT; electronic warfare; and C5ISR systems. HII works within our nation's intelligence and cyber operations communities to defend our interests in cyberspace and anticipate emerging threats. Our capabilities in cybersecurity, network architecture, reverse engineering, software and hardware development uniquely enable us to support sensitive missions for the U.S. military and federal agency partners. Meet HII's Mission Technologies Division Our team of more than 7,000 professionals worldwide delivers all-domain expertise and advanced technologies in service of mission partners across the globe. Mission Technologies is leading the next evolution of national defense - the data evolution - by accelerating a breadth of national security solutions for government and commercial customers. Come join our growing team today, supporting our Warfare Systems Group! HII-Mission Technologies is currently seeking a skilled Data Scientist, who will support refining a centralized data environment (CDE) with impact across DoD! Key functionality for this position contributes toward auditable financial transaction data, and building a common operating picture for leadership. A successful candidate will be well-versed in Python, relational database (SQL), has strong written/oral communications, and can work as part of a team providing productive input toward solving challenging problems. What you will do Responsible for managing the BPC Decoupling Dashboard: the data infrastructure, Databricks codebase, and Qlik Application - to support sustainment, stability, performance, and security of the platform Advana Role: Lead Builder (applies to both Databricks and Qlik Sense) Monitors, maintains, configures, and audits the data pipelines in conjunction with the Databricks platform to ensure all data sources are ingesting as intended, the dashboard is functioning as intended, and ensuring high data quality standards Qlik Dashboard development with data analytics and visualization, emphasis on data-driven business intelligence and UI/UX practices, and data transformation via the use of calculated fields and expressions Interface with stakeholders to gather requirements and translate them to actionable development goals Oversee and manage data loading scripts and data modeling design in Qlik Manage connections to data sources and pipelines, and their scheduled ingest jobs in Databricks and the Advana Data Catalog Additional duties as assigned or required Minimum Qualifications 5 years relevant experience with Bachelors in related field; 3 years experience with Masters in related field; or High School Diploma or equivalent and 9 years relevant experience Experience with Python and SQL, especially with a focus on data pipeline engineering, relational databases, and analytics Experience with data visualization and business intelligence experience (Tableau, Qlik, PowerBI) Understanding of ETL (Extract, Transform, Load) with various types of structured and unstructured data sources and systems Developing code in DataBricks or similar notebook-style data warehouse platform Proficiency in creating calculated fields and expressions to create dynamic dashboards Strong understanding of data modeling concepts Experience working in Agile/Scrum environments, participating in sprints, and collaborating closely with development teams Clearance: Must possess and maintain a Secret clearance Qlik Sense or DataBricks Certifications Understanding of Advana DataBricks' Gold/Silver/Bronze data zones and Delta Lakehouse Architecture Solid understanding of Object-Oriented programming and efficient code development practices Knowledge of Apache Spark and similar programming to support streaming data Experience with some of the following Python libraries: NumPy, Pandas, PySpark, Dask, Apache Airflow, Luigi, SQLAlchemy, Great Expectations, Petl, Boto3, matplotlib, dbutils, koalas, OpenPyXL, XlsxWriter Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack, Splunk). Experience creating and managing scripts, scheduled jobs, and automation Experience with Unit Testing, Integration Testing, or creating test scripts Domain knowledge of data sources and reports and their stakeholders, understands the transactional nature of the data, and experience with FMS (Foreign Military Sales) Physical Requirements May require working in an office, industrial, or laboratory environment. #J-18808-Ljbffr
Created: 2025-02-01