Data Engineer
Connecticut Innovations - jersey city, NJ
Apply NowJob Description
Are you ready to join Connecticut Innovation's vibrant community of innovators? Connecticut Innovations ("CI") is Connecticut's strategic venture capital arm, and we are passionate about serving our portfolio of 220+ companies across various industries, with strengths in life sciences, technology, and climate e join Noteworthy AI - Routine fleet operations. Extraordinary grid insights as a Data Engineer!Noteworthy AI OverviewAt Noteworthy AI, our mission is to improve the reliability, resiliency, and safety of the electric grid. Our vehicle-mounted cameras and AI help utilities and other grid operators increase their situational awareness of their assets while reducing costs. Our platform autonomously geolocates, photographs, and analyzes grid infrastructure as vehicles drive during routine operations, enabling more proactive grid management.We've gained significant market traction, validation, and support from customers like Florida Power & Light, FirstEnergy Corp, and Alabama Power, investors like Earthshot Ventures and Techstars, and partners like Nvidia - so we are looking for great people to come and join our growing team! 🚀We plan to expand our office space in New Haven, CT, later this year to enable more in-person collaboration and expect this role to follow a hybrid schedule.About YouYou are excited to roll up your sleeves at a fast-growing startup that is playing a critical role in helping to keep the electric grid energized and resilient.You're experienced writing Python and enabling machine-learning model development by processing and handling dataYou want to grow your career by working with a dynamic research team on cutting-edge applied AI research and development, contributing to novel ML research, and learning key skills for AI and ML engineeringResponsibilitiesProcess and handle data to enable machine-learning (ML) and AI development tasks (i.e., dataset curation, labeling, training, inference, evaluation) and maintain a traceable, automated ML operations workflowLead analyses and experiments to identify salient data for ML development and characterize the performance of (ML) models.Design, build, and improve data pipelines, interfaces, visualizations, and associated code infrastructureMaintain databases and data lakes for ML development while ensuring data integrity and securitySupport and interface with internal stakeholders to generate high-quality deliverables for customersContribute to internal documentation, code, and data standards and toolingMinimum QualificationsBachelor's degree in Computer Science, Engineering, Mathematics, Statistics, or a related field OR commensurate experience in software development, data science, math, and statisticsStrong proficiency and recent experience in Python and standard data science libraries (numpy, pandas, scikit-learn, matplotlib, seaborn)Demonstrated ability to write efficient and reliable software following best practices in software design, testing, review, and documentationStrong technical communication skillsAbility to collaborate and work effectively on complex software systems in a team settingA growth mindset, a willingness to take ownership of your work, and an ability to adapt to the challenges of a fast-paced startup environmentPreferred QualificationsExperience designing data pipelines, handling large volumes of data, and generating compelling visualizationsExperience in SQL, relational databases, andor related topics (i.e., database design, query optimization, NoSQL, RDSMS, etc.)Experience with Amazon Web Service (AWS) products (e.g., S3, Redshift, DynamoDB, Lambda, Sagemaker) or equivalent cloud services (Microsoft Azure, Google Cloud Platform)Experience andor strong technical foundations in computer vision, like cameras and image capture, image encoding and storage, image processing and filtering, 3D vision, feature extractionExperience andor strong technical foundations in machine learning, supervised learning, optimization, neural networks, applications in computer vision (image classification, object detection, semantic segmentation, keypoint detection, tracking, etc.)Hands-on experience writing ML training andor inference code in Python and common libraries (e.g., PyTorch, Tensorflow, Keras, scikit-learn, Huggingface, Weights & Biases, Tensorboard)Knowledge of ML operations and best practices for production-grade ML model developmentWhat We OfferCompetitive salary, equity, and benefitsOpportunity to make an impact with AI in the increasingly important energy sectorProfessional development and leadership opportunitiesFlexible work hours in a hybrid settingLocation: This is a hybrid role. We prefer Connecticut-based candidates who can work from our office in New Haven several days per week.Diversity, Equity and Inclusiveness Noteworthy AI is committed to building an inclusive organization that reflects the diverse communities our team works to serve. We believe that diversity in all its forms (gender, race, ethnicity, age, sexual orientation, religion, veteran's status, disability and more) is essential to imagining and actively building a more just and sustainable future for all. We also actively promote diversity outside our organization, through the partnerships we enter into and the business decisions we make.
Created: 2025-02-21