• Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role.
India Jobs Expertini

Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) Job Opening In Karnataka – Now Hiring American Express

Site Reliability Engineer II (Apache Spark, Python, AWS/GCP)

    India Jobs Expertini Expertini India Jobs Karnataka Other General Site Reliability Engineer Ii (Apache Spark, Python, Aws/gcp)

Job description

**Description**
**You Lead the Way.

We’ve Got Your Back.**
With the right backing, people and businesses have the power to progress in incredible ways.

When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other.

Here, you’ll learn and grow as we help you create a career journey that’s unique and meaningful to you with benefits, programs, and flexibility that support you personally and professionally.
At American Express, you’ll be recognized for your contributions, leadership, and impact—every colleague has the opportunity to share in the company’s success.

Together, we’ll win as a team, striving to uphold our company values and powerful backing promise to provide the world’s best customer experience every day.

And we’ll do it with the utmost integrity, and in an environment where everyone is seen, heard and feels like they belong.
Join Team Amex and let's lead the way together.
**How will you make an impact in this role?**
We are seeking an experienced Site Reliability Engineer to join our Big Data infrastructure team.

This role focuses on ensuring the reliability, scalability, and performance of our Apache Spark-based data processing systems and broader big data ecosystem.

The ideal candidate will have 5 years of hands-on experience with distributed systems, data platforms, and SRE practices.
**Key Responsibilities:**
**Infrastructure Management & Reliability**
+ Design, implement, and maintain highly available Apache Spark clusters and big data infrastructure across cloud and on-premises environments
+ Monitor and optimize performance of distributed data processing workloads, ensuring SLA compliance and minimal downtime
+ Implement comprehensive monitoring, alerting, and observability solutions for big data pipelines and infrastructure components
+ Lead incident response and post-mortem analysis for data platform outages, implementing preventive measures to avoid recurrence
**Automation & Operations**
+ Develop and maintain Infrastructure as Code (IaC) solutions using tools like Terraform, Ansible, or CloudFormation for big data infrastructure provisioning
+ Build automated deployment pipelines and CI/CD workflows for Spark applications and data platform components
+ Create and maintain runbooks, operational procedures, and disaster recovery plans for critical data systems
+ Implement capacity planning and auto-scaling solutions to handle varying data processing workloads efficiently
**Platform Engineering & Optimization**
+ Collaborate with data engineering teams to optimize Spark job configurations, cluster sizing, and resource allocation
+ Design and implement data platform governance, security, and compliance measures
+ Evaluate and integrate new big data technologies and tools to improve platform capabilities and performance
+ Establish best practices for code deployment, configuration management, and system maintenance
**Required Skills and Experience:**
**Technical Expertise**
+ 5 years of experience in Site Reliability Engineering, DevOps, or similar roles with focus on distributed systems
+ Deep hands-on experience with Apache Spark (Scala, Python/PySpark) and Spark cluster management (YARN, Kubernetes, or standalone)
+ Proficiency with big data ecosystem technologies including Hadoop, HDFS, Hive, Kafka, Airflow, and data lakes/warehouses
+ Strong experience with cloud platforms (AWS, GCP, or Azure) and their big data services (EMR, Dataproc, HDInsight, etc.)
+ Advanced knowledge of containerization technologies (Docker, Kubernetes) and orchestration in data processing contexts
**Infrastructure & Monitoring**
+ Experience with infrastructure monitoring and observability tools (Prometheus, Grafana, ELK stack, Datadog, or similar)
+ Proficiency in Infrastructure as Code tools (Terraform, CloudFormation, Ansible) for managing big data infrastructure
+ Strong Linux/Unix system administration skills and experience with configuration management tools
+ Knowledge of networking, security, and performance tuning in distributed computing environments
**Programming & Automation**
+ Proficient in at least one programming language (Python, Scala, Java, or Go) for automation and tooling development
+ Experience with CI/CD pipelines and version control systems (Git, Jenkins, GitLab CI, or similar)
+ Strong scripting skills (Bash, Python) for automation and operational tasks
+ Understanding of software engineering best practices including testing, code review, and documentation
**Preferred Qualifications**
+ Experience with stream processing frameworks (Kafka Streams, Apache Flink, or Spark Streaming)
+ Knowledge of data governance, data quality, and data lineage tools
+ Familiarity with machine learning operations (MLOps) and model deployment at scale
+ Experience with database technologies (SQL, NoSQL) and data warehouse solutions
+ Relevant certifications in cloud platforms or big data technologies
**Qualifications**
We back you with benefits that support your holistic well-being so you can be and deliver your best.

This means caring for you and your loved ones' physical, financial, and mental health, as well as providing the flexibility you need to thrive personally and professionally:
+ Competitive base salaries
+ Bonus incentives
+ Support for financial-well-being and retirement
+ Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location)
+ Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need
+ Generous paid parental leave policies (depending on your location)
+ Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)
+ Free and confidential counseling support through our Healthy Minds program
+ Career development and training opportunities
American Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability status, age, or any other status protected by law.
Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.
**Job:** Technology
**Primary Location:** India-Karnataka-Bengaluru Urban
**Schedule** Full-time
**Req ID:** 25014455

Required Skill Profession

Other General


  • Job Details

Unlock Your Site Reliability Potential: Insight & Career Growth Guide


Real-time Site Reliability Jobs Trends (Graphical Representation)

Explore profound insights with Expertini's real-time, in-depth analysis, showcased through the graph here. Uncover the dynamic job market trends for Site Reliability in Karnataka, India, highlighting market share and opportunities for professionals in Site Reliability roles.

7879 Jobs in India
7879
14 Jobs in Karnataka
14
Download Site Reliability Jobs Trends in Karnataka and India

Are You Looking for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) Job?

Great news! is currently hiring and seeking a Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) to join their team. Feel free to download the job details.

Wait no longer! Are you also interested in exploring similar jobs? Search now: .

The Work Culture

An organization's rules and standards set how people should be treated in the office and how different situations should be handled. The work culture at American Express adheres to the cultural norms as outlined by Expertini.

The fundamental ethical values are:

1. Independence

2. Loyalty

3. Impartiapty

4. Integrity

5. Accountabipty

6. Respect for human rights

7. Obeying India laws and regulations

What Is the Average Salary Range for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) Positions?

The average salary range for a varies, but the pay scale is rated "Standard" in Karnataka. Salary levels may vary depending on your industry, experience, and skills. It's essential to research and negotiate effectively. We advise reading the full job specification before proceeding with the application to understand the salary package.

What Are the Key Qualifications for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP)?

Key qualifications for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) typically include Other General and a list of qualifications and expertise as mentioned in the job specification. The generic skills are mostly outlined by the . Be sure to check the specific job listing for detailed requirements and qualifications.

How Can I Improve My Chances of Getting Hired for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP)?

To improve your chances of getting hired for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP), consider enhancing your skills. Check your CV/Résumé Score with our free Tool. We have an in-built Resume Scoring tool that gives you the matching score for each job based on your CV/Résumé once it is uploaded. This can help you align your CV/Résumé according to the job requirements and enhance your skills if needed.

Interview Tips for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) Job Success

American Express interview tips for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP)

Here are some tips to help you prepare for and ace your Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) job interview:

Before the Interview:

Research: Learn about the American Express's mission, values, products, and the specific job requirements and get further information about

Other Openings

Practice: Prepare answers to common interview questions and rehearse using the STAR method (Situation, Task, Action, Result) to showcase your skills and experiences.

Dress Professionally: Choose attire appropriate for the company culture.

Prepare Questions: Show your interest by having thoughtful questions for the interviewer.

Plan Your Commute: Allow ample time to arrive on time and avoid feeling rushed.

During the Interview:

Be Punctual: Arrive on time to demonstrate professionalism and respect.

Make a Great First Impression: Greet the interviewer with a handshake, smile, and eye contact.

Confidence and Enthusiasm: Project a positive attitude and show your genuine interest in the opportunity.

Answer Thoughtfully: Listen carefully, take a moment to formulate clear and concise responses. Highlight relevant skills and experiences using the STAR method.

Ask Prepared Questions: Demonstrate curiosity and engagement with the role and company.

Follow Up: Send a thank-you email to the interviewer within 24 hours.

Additional Tips:

Be Yourself: Let your personality shine through while maintaining professionalism.

Be Honest: Don't exaggerate your skills or experience.

Be Positive: Focus on your strengths and accomplishments.

Body Language: Maintain good posture, avoid fidgeting, and make eye contact.

Turn Off Phone: Avoid distractions during the interview.

Final Thought:

To prepare for your Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) interview at American Express, research the company, understand the job requirements, and practice common interview questions.

Highlight your leadership skills, achievements, and strategic thinking abilities. Be prepared to discuss your experience with HR, including your approach to meeting targets as a team player. Additionally, review the American Express's products or services and be prepared to discuss how you can contribute to their success.

By following these tips, you can increase your chances of making a positive impression and landing the job!

How to Set Up Job Alerts for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) Positions

Setting up job alerts for Site Reliability Engineer II (Apache Spark, Python, AWS/GCP) is easy with India Jobs Expertini. Simply visit our job alerts page here, enter your preferred job title and location, and choose how often you want to receive notifications. You'll get the latest job openings sent directly to your email for FREE!