Senior Member of Technical Staff
Oracle - Seattle, WA
Apply NowJob Description
At Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We act with the speed and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world. Oracle Generative AI Service is an exciting team in Oracle Cloud Infrastructure. We are delivering innovative services at the intersection of artificial intelligence and cloud infrastructure. In Generative AI Service team, you will build and operate massive-scale cloud services leveraging state of art machine learning technologies. We are committed to providing the best in cloud products to meet the needs of our customers who are tackling some of the world's most challenging problems. You will be part of a team of smart, hands-on machine learning engineers with the expertise and passion to solve difficult problems in distributed highly available services and virtualized infrastructure. At every level, our engineers have a significant technical and business impact by designing and building innovative new systems to power our customer's business critical applications. What we offer: Being part of one of the most visionary and mission-driven organizations in Oracle, cooperating with talented peers with diverse backgrounds worldwide. High visibility to senior leadership, opportunity to make huge impacts across organizations. Opportunity to build state-of-the-art technologies in large language models (LLM) and generative AI at scale to solve real business problems. Close partnership with applied scientists and software engineers to deploy solutions into production in various business-critical scenarios. About You: You are an experienced machine learning engineer with a proven track record of delivering large-scale, high-performance model serving/training systems in production. You are obsessed with customers and exceeding their expectations. You have excellent communication skills and you can clearly explain complex technical concepts. You are a disciplined engineer who understands the importance of high standards, never satisfied with mediocrity and constantly striving for excellence. You are passionate about technology and self-motivated to stay updated with latest developments in machine learning related technologies. Minimum Qualifications BS in Computer Science, or equivalent experience. 3-5+ years of experience shipping scalable, cloud-native distributed systems. Ability to work in a collaborative, cross-functional team environment. Proficient in Python and shell scripting tools. Experience with container orchestration technologies like Kubernetes. Experience with production operations and best practices for putting quality code in production and troubleshoot issues when they arise. Able to effectively communicate technical ideas verbally and in writing (technical proposals, design specs, architecture diagrams and presentations). Preferred Qualifications MS in Computer Science. Familiarity with micro-services architecture. Experience with Large Language Model (LLM) serving technologies like DeepSpeed, FasterTransformer etc. Experience with deploying AI models to production. Experience in diagnosing, troubleshooting and resolving issues in AI model training and serving. Career Level - IC3 Qualifications Disclaimer: Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates. Range and benefit information provided in this posting are specific to the stated locations only. US: Hiring Range in USD from: $79,000 to $178,100 per annum. May be eligible for bonus and equity. Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business. Oracle US offers a comprehensive benefits package which includes the following: Medical, dental, and vision insurance, including expert medical opinion. Short term disability and long term disability. Life insurance and AD&D. Supplemental life insurance (Employee/Spouse/Child). Health care and dependent care Flexible Spending Accounts. Pre-tax commuter and parking benefits. 401(k) Savings and Investment Plan with company match. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. 11 paid holidays. Paid sick leave: 72 hours of paid sick leave upon date of hire. Paid parental leave. Adoption assistance. Employee Stock Purchase Plan. Financial planning and group legal. Voluntary benefits including auto, homeowner and pet insurance. The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted. Responsibilities As a Machine Learning Engineer in Generative AI Service team, you will be leading the effort of building distributed, scalable, high-performance AI model training, finetuning and serving systems in partnership with our applied scientists and software engineers. You will dive deep into model structure to optimize model performance and scalability. You will build state of art systems with cutting-edge technologies in this fast evolving area. You will benchmark, diagnose, troubleshoot and resolve issues in AI model training, finetuning and serving. You may also perform other duties as assigned. #J-18808-Ljbffr
Created: 2025-03-01