Software Engineer - Data
Treeswift Inc - new york city, NY
Apply NowJob Description
About Treeswift Treeswift is revolutionizing decision-making for large-scale landscapes and critical infrastructure in the face of rising threats like severe storms and wildfires. After getting our start developing AI models for forests in the timber sector, we are growing in the utility sector. In the U.S. alone, hundreds of thousands of miles of transmission lines and millions of miles of distribution lines need to be monitored and managed to reduce hazards, such as encroaching vegetation that can cause outages or fires. With the rapid growth of renewable energy this infrastructure footprint is expected to more than double in the coming years creating more assets to defend than ever before. Treeswift's mission is to deliver effective scalable technology to manage the landscapes around large-scale distributed infrastructure, such as power lines. We do so by innovating across the technical stack. Data is collected by our sensor packs mounted on backpacks and vehicles to seamlessly integrate into customer operations. We then leverage AI (i.e. computer vision, etc...) to transform collected data into actionable insights delivered to decision-makers in our software platform. We are early in our journey and focused on finding ground truth by working closely with our customers and using this understanding to deliver real value. To tackle this challenge we have brought together mission-driven robotics experts from top academic institutions (Penn, CalTech, etc...) and professionals with deep industry experience (Palantir) in enterprise software development. We hope you'll join us. What you will do Treeswift is looking for a highly skilled and motivated Software Engineer (Data) who will - Stand-up and manage data pipelines from raw inputs to final customer-facing data products. When raw data from our sensors is uploaded there are several transformations before the final customer-facing product. You will take the lead on architecting, implementing and maintaining a pipeline that automates and optimizes these steps. You will consider cost and performance in your design and collaborate with other team members on implementation. When needed, you will engage with internal and external users to scope requirements. Support rapid model development. Dataset and dataset management is key to accelerating our machine learning workflows. You will help facilitate the development of training datasets and model result evaluation. You are not expected to be a machine learning expert, but you will contribute to early infrastructure to accelerate these outcomes. Contribute where you are most needed. We are at an early stage and expect everyone to wear a few hats. You might be asked to help annotate some features in a dataset, contribute a new end-user software feature (with support of course) or join a field visit to collect some sample data. We want you to be excited about learning new things, leveling up others and pitching in where you're needed! This is a full-time, hybrid (2-3 day a week in person) role, based out of our NYC office. Required Skills 4+ years of experience working with data products, pipelines and relevant tooling You have thoughtful opinions about pipeline architecture and relevant tooling. You can guide Treeswift to make decisions that enable near-term outcomes, but also scale well to future needs. Where trade-offs between these objectives exist you can articulate them and make a recommendation. You think critically about database and pipeline design, understand how it supports downstream workflows and potential implications for reliability, scale and performance. Experience with containerized distributed compute and orchestration, ideally with AWS technologies (S3, RDS, ECS, etc...) Experience standing up, migrating, and managing relational databases. You have stood up CD pipelines. You are very proficient in Python Strong communication and collaboration. Preferred Skills Experience designing and implementing basic security protocols, and effectively communicating those to customer counterparts. Experience supporting machine learning workflows and deploying models in production environments. Experience with geospatial and imagery data What We Value Mission first. We value low ego team members who focus on working towards the best outcomes for the customer and the business, regardless of who gets the credit. Truth seeking. We don't always have perfect information to make decisions, but we seek to constantly get closer to the ground truth and aren't afraid to learn we were wrong in the process. Owners mindset. If you see an opportunity for improvement, run with it. We believe that good ideas can come from anywhere, no matter your role. Benefits Comprehensive medical, dental and vision insurance Life insurance package and disability coverage Stock options Paid leave for new parents Unlimited PTO 401K Salary The estimated salary range for this position is $150,000-$180,000, flexible based on experience. Total compensation for this position is determined by skills, qualifications, relevant work experience, location, and other factors. This salary estimate excludes the value of any potential bonuses; the value of any benefits offered; and the potential future value of any long-term incentives. This information is provided per the New York City Human Rights Law. Please note that the range provided is applicable only to New York City-based applicants. Base compensation may vary if the work location is outside of New York City. Treeswift is proud to be an equal opportunity employer. We provide employment opportunities without regard to age, race, color, ancestry, national origin, religion, disability, sex, gender identity or expression, sexual orientation, veteran status, or any other protected status in accordance with applicable law. If you require any accommodations during the recruitment process, whether it be alternate forms of material, accessible meeting rooms, etc., please let us know and we will work with you to meet your needs.
Created: 2024-11-04