Python Spark AWS @ Columbus, OH

Diverse Lynx - columbus, OH

Apply Now

Job Description

Job Title : Python Spark AWS Location : Columbus, OH - non local candidates accepted Job Responsibilities: Develop and maintain data platforms using Python, Spark, and PySpark. Handle migration to PySpark on AWS. Design and implement data pipelines. Work with AWS and Big Data. Produce unit tests for Spark transformations and helper methods. Create Scala/Spark jobs for data transformation and aggregation. Write Scaladoc-style documentation for code. Optimize Spark queries for performance. Integrate with SQL databases (e.g., Microsoft, Oracle, Postgres, MySQL). Understand distributed systems concepts (CAP theorem, partitioning, replication, consistency, and consensus). Skills: Proficiency in Python, Scala (with a focus on functional programming), and Spark. Familiarity with Spark APIs, including RDD, DataFrame, MLlib, GraphX, and Streaming. Experience working with HDFS, S3, Cassandra, and/or DynamoDB. Deep understanding of distributed systems. Experience with building or maintaining cloud-native applications. Familiarity with serverless approaches using AWS Lambda is a plus Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.

Created: 2024-11-05

➤

Login

Create Account

Python Spark AWS @ Columbus, OH