Data Engineer PySpark
__jobinformationwidget.freetext.LocationText__
Chennai, Tamil Nadu, India
- Sopra Steria
- Engineering, Development, Applications
- 6 to 10 years
- Standard
- India
About Sopra Steria
Sopra Steria, a major Tech player in Europe with 56,000 employees in nearly 30 countries, is recognized for its consulting, digital services and software development. It helps its clients drive their digital transformation and obtain tangible and sustainable benefits. The Group provides end-to-end solutions to make large companies and organizations more competitive by combining in-depth knowledge of a wide range of business sectors and innovative technologies with a fully collaborative approach. Sopra Steria places people at the heart of everything it does and is committed to putting digital to work for its clients in order to build a positive future for all. In 2023, the Group generated revenues of €5.8 billion.
The world is how we shape it.
We are seeking a highly skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will collaborate closely with our Data Scientists to develop and deploy machine learning models. Proficiency in below listed skills will be crucial in building and maintaining pipelines for training and inference datasets.
Responsibilities:
• Work in tandem with Data Scientists to design, develop, and implement machine learning pipelines.
• Utilize PySpark for data processing, transformation, and preparation for model training.
• Leverage AWS EMR and S3 for scalable and efficient data storage and processing.
• Implement and manage ETL workflows using Stream sets for data ingestion and transformation.
• Design and construct pipelines to deliver high-quality training and inference datasets.
• Collaborate with cross-functional teams to ensure smooth deployment and real-time/near real-time inferencing capabilities.
• Optimize and fine-tune pipelines for performance, scalability, and reliability.
• Ensure IAM policies and permissions are appropriately configured for secure data access and management.
• Implement Spark architecture and optimize Spark jobs for scalable data processing.
Requirements:
Mandatory
• Proficiency in Advanced SQL (Window functions), Spark Architecture, Pyspark or Scala with Spark, Hadoop.
• Proven expertise in designing and deploying data pipelines.
• Strong problem-solving skills and ability to work effectively in a collaborative team environment.
• Excellent communication skills and ability to translate technical concepts to non-technical stakeholder
Desirable
• Hands-on experience with Airflow, S3, and Stream sets or similar ETL tools. [ can be trained locally ]
• Understanding of real-time or near real-time inferencing architectures.
- •Basic Knowledge on Kafka ,AWS IAM, AWS EMR and Snowflake.
Total Experience Expected: 06-08 years
BE
At our organization, we are committed to fighting against all forms of discrimination. We foster a work environment that is inclusive and respectful of all differences.
All of our positions are open to people with disabilities.
Discover what working at Sopra Steria looks like...
Are you looking for a place where you can free your creativity and take initiatives, supported by tech experts?
Join us on this adventure where every idea counts and every talent steps up.
Job offers that might interest you
Salary
Location
Chennai, Tamil Nadu, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
India
Description
Role: Data Quality Analyst,Skillset: Python, SQL, Tableau, Data VisualizationExperience: 6-8 yearsLocation: Noida, Chennai & PuneDomain: Credit Risk/BankingRoles & ResponsibilitiesPerform the anal
Reference
1901cd3d-5f0c-46be-8e5c-2a41e066d172
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Chennai, Tamil Nadu, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
India
Description
Role: Data Analyst,Skillset: Python, SQL, Tableau, Data VisualisationExperience: 6-8 yearsLocation: Noida, Chennai & PuneRoles & ResponsibilitiesExperience working in large Software Development TeamsK
Reference
ba83c423-81b6-40dc-8b39-fb3ba46ea863
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Bengaluru, Karnataka, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
Bengaluru, Karnataka
Description
Must have:ConfluentKafkaNode/Java/.NET - >=3 Years experience Role Expectation:We are looking for an experienced Confluent Kafka Administrator/ Developer to assist us in managing and supporting our Ka
Reference
7ce2184d-9afa-4bd8-9e57-dbccd4d0df52
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Noida, Uttar Pradesh, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
Noida, Uttar Pradesh
Description
Roles and Responsibilities:Install, configure and manage high availability Fusion Middleware components in production environmentsInstall, configure and manage connectors for integration with AD and o
Reference
a2314511-9f06-438c-9ee3-8e39476a86a5
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Bengaluru, Karnataka, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
Bengaluru, Karnataka
Description
Primary: Python, React, HTML, CSS, FastAPIGood knowledge of Agile/SAFe. Secondary: Microservice, Full stack frameworks Good to have:Knowledge on AWS (lamda & API gateway), Pytest is an advantageTotal
Reference
e09757b8-1815-4946-8bdd-a87774fb58cb
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Bengaluru, Karnataka, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
Bengaluru, Karnataka
Description
Primary: Core java, J2EE, mulithreading, java server faces, Rest Api, SQL, linux shell scripting, Git, Eclipse/IntellijaGood knowledge of Agile/SAFe. Secondary: AWS Basics, Python, Jasper reporting, M
Reference
8a4b7af8-3f8a-4a91-a8d5-b45c8a2f3205
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Bengaluru, Karnataka, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
Bengaluru, Karnataka
Description
5-8 years of experience in DB admin and development. Must have:Good experience with SQL queries, PLSQL and SQL scriptingGood knowledge in SQL, Normalization, constraints, keys, joinsGood understanding
Reference
1f4058aa-1748-47a4-9181-226d5a0a32ae
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Noida, Uttar Pradesh, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
Noida, Uttar Pradesh
Description
Roles and Responsibilities:Administration of MFT Axway solutions including Secure Transport, XFB Gateway, CFT, and Sentinel.Design detailed migration plans, including timelines, dependencies, and roll
Reference
e435c00f-c655-4b29-abcf-efdfd26e0b8d
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Noida, Uttar Pradesh, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
Noida, Uttar Pradesh
Description
· Minimum of 6-8 years of SD experience in full cycle implementation as well as in support projects.· Minimum 1-2 E2E SD Implementation experience.· Knowledge of working experience of S
Reference
957eb069-43cf-4289-9d23-388c590e4ad7
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah SmithSalary
Location
Bengaluru, Karnataka, India
Job Type
Standard
Experience Level
6 to 10 years
Department
Engineering, Development, Applications
Brand
Sopra Steria
Location
Bengaluru, Karnataka
Description
Primary: Core java, Swing, Corba, SQL, linux Shell scripting, eclipse/intellij, Git.Good knowledge of Agile/SAFe. Secondary: Python, Jasper reporting, Microservices, html, jsfTotal Experience Expected
Reference
99fef918-462f-4254-8049-965de366398d
Expiry Date
Jan 1, 0001
Author
Sarah SmithAuthor
Sarah Smith