Data Architect
Genpact - Los Angeles, CA
Apply NowJob Description
Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose - the relentless pursuit of a world that works better for people - we serve and transform leading enterprises, including the Fortune Global 500, with our deep business and industry knowledge, digital operations services, and expertise in data, technology, and AI.Inviting applications for the role of Senior Principal Consultant- Databricks Platform Architect!Locations:California - Bay Area, Los AngelesOregon - Portland Georgia - Atlanta Chicago - ILHouston - TXNY, NJ. We are seeking a highly skilled and experienced Databricks Platform Architect to lead the design, implementation, and optimization of our Databricks platform. The ideal candidate will have a strong background in cloud-based data processing systems, data warehousing, and big data technologies. They will work closely with our data engineering team to ensure that our Databricks platform is optimized for performance, scalability, and reliability.Qualifications we seek in you! Minimum qualifications Proven experience, as a Data Architect, Data Solutions Architect, or similar role in a consulting environment.Hands-on Experience to design platform and build Databricks based solution on cloud platform (Azure, AWS).Excellent technical architecture skills, enabling the creation of future-proof, complex global Platform solutions on Databricks.Excellent interpersonal communication and organizational skills are required to operate as a leading member of global, distributed teams that deliver quality services and solutions.Ability to rapidly gain knowledge of the organizational structure of the firm to facilitate work with groups outside of the immediate technical team.Knowledge and experience in IT methodologies and life cycles that will be used.Familiar with solution implementation/management, service/operations management, etc.Leadership skills - ability to inspire others and persuade.Maintains close awareness of new and emerging technologies and their potential application for service offerings and products.Bachelor's Degree or equivalency (CS, CE, CIS, IS, MIS, or engineering discipline) or equivalent work experience.Experience in a Platform architecture role using service and hosting solutions such as private/public cloud IaaS, PaaS, and SaaS platforms.Experience in architecting and designing technical solutions for cloud-centric solutions based on industry standards using IaaS, PaaS, and SaaS capabilities.Must have strong hands-on experience on various cloud services like ADF/Lambda, ADLS/S3, Security, Monitoring, Governance & Compliance.Must have experience to design platform on Databricks.hands-on Experience to design and build Databricks based solution on any cloud platform. hands-on experience to design and build solution powered by DBT models and integrate with databricks.Must be very good designing End-to-End solution on cloud platform.Must have good knowledge of Data Engineering concept and related services of cloud.Must have good experience in Python and Spark.Must have good experience in setting up development best practices.Good to have knowledge of docker and Kubernetes.Experience with claims-based authentication (SAML/OAuth/OIDC), MFA, RBAC, SSO etc.Knowledge of cloud security controls including tenant isolation, encryption at rest, encryption in transit, key management, vulnerability assessments, application firewalls, SIEM, etc.Experience building and supporting mission-critical technology components with DR capabilities.Experience with multi-tier system and service design and development for large enterprisesExtensive, real-world experience designing technology components for enterprise solutions and defining solution architectures and reference architectures with a focus on cloud technologies.Exposure to infrastructure and application security technologies and approachesFamiliarity with requirements gathering techniques.Preferred qualifications Knowledge and hands of experience of Unity Catalog Implementation (Access Policies, Data Security, Data Discovery, Delta Sharing, SQL Warehouse, Computes, etc).Knowledge of UC Migration tool such as UCX. (Optional)Knowledge of CI/CD deployment and Terraform scriptingKnowledge of Data Engineering concept and related services of cloud.Experience in Python and Spark and setting up development best practices.Knowledge of docker and Kubernetes.Experience with claims-based authentication (SAML/OAuth/OIDC), MFA, RBAC, SSO etc.Knowledge of cloud security controls including tenant isolation, encryption at rest, encryption in transit, key management, vulnerability assessments, application firewalls, SIEM, etc.Must have designed E2E Platform architecture on Databricks covering all the aspect of data lifecycle starting from Data Ingestion, Transformation, Serve and consumption.Must have excellent coding skills either Python or Scala, preferably Python.Must have experience in Data Engineering domainMust have designed and implemented at least 2-3 project end-to-end in Databricks.Strong expertise in Apache Spark, Delta Lake, and other Databricks components for data processing and analytics.Delta lakedb API 2.0SQL Endpoint - Photon engineUnity CatalogSecurity managementPlatform governanceData SecurityProficiency in AWS services including but not limited to S3, EC2, IAM, VPC, EKS, Lambda, Glue, Private Link, KMS, CloudWatch, EMR etc.Must have knowledge of new features available in Databricks and its implications along with various possible use-case.Strong expertise in designing SOX compliant platform architecture.Must know how to manage various Databricks workspace and its integration with other applications.Proficient in designing and implementing Everything as a codeInfrastructure as a codeConfiguration as a codeConfiguration as a codeSecurity configuration as a codeMust have strong expertise in designing platform with strong observability and Monitoring standards.Proficient in setting best practices of various DevSecOps activities including CI/CD.Must have knowledge of Databricks cluster optimization and its integration with various cloud services.Must have strong performance optimization skills to improve efficiency and reduce cost.Must have strong communication skills and have worked with cross platform team.Must have great attitude towards learning new skills and upskilling the existing skills.Responsible to set best practices around Databricks CI/CD.Must understand composable architecture to take fullest advantage of Databricks capabilities.Good to have Rest API knowledge.Good to have understanding around cost distribution.Good to have if worked on migration project to build Unified data platform.Good to have knowledge of DBT.Software development full lifecycle methodologies, patterns, frameworks, libraries, and toolsKnowledge of programming and scripting languages such as JavaScript, PowerShell, Bash, SQL, Java, Python, etc.Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values diversity and inclusion, respect and integrity, customer focus, and innovation. Get to know us at and on LinkedInXYouTube, and FacebookFurthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a 'starter kit,' paying to apply, or purchasing equipment or training.
Created: 2025-01-24