Site Reliability Engineer (SRE) Job at PayPay, Remote

cWVjOFg3TlZXZmtqU3Era0R6U2M0YVRuUXc9PQ==
  • PayPay
  • Remote

Job Description

About PayPay

PayPay is a FinTech company that has grown to over 69M (as of May 2025) users since its launch in 2018. Our team is hugely diverse with members from over 50 different countries.

OUR VISION IS UNLIMITED_

We dare to believe that we do not need a clear vision to create a future beyond our imagination. PayPay will always stay true to our roots and realize a vision (future) that no one else can imagine by constantly taking risks and challenging ourselves. With this mindset, you will be presented with new and exciting opportunities on a daily basis and have the opportunity to grow and reach new dimensions that you could never have imagined. We are looking for people who can embrace this challenge, refresh the product at breakneck speed and promote PayPay with professionalism and passion.
※ Please note that you cannot apply or be selected in parallel with PayPay Corporation, PayPay Card Corporation and PayPay Securities Corporation.

Job Description

At PayPay, we’re constantly working on improving our systems and processes to support PayPay’s exponential growth. As an SRE at PayPay, we strive towards ensuring high availability and top-level performance so that our users can have flawless and reliable service exceeding expectations.
Considering PayPay’s growth, we are looking for experienced SREs who can deliver insights into system bottlenecks and ensure system reliability and scalability, while increasing the number of services that our company offers.
We are looking for individuals who can bring informed and unique viewpoints, enjoy collaborating with a cross-functional team and are actively pushing boundaries to develop reliable and scalable solutions and positive user experiences.

Key Responsibilities

  • Analyze current technologies used in the company and develop monitoring and notification tools to improve observability and visibility.
  • Ensure system stability by pre-emptively verifying failure scenarios and implement solutions to reduce MTTR
  • Develop solutions to improve system performance with a focus on high availability, scalability and resilience
  • Integrate telemetry and alerting platforms to track and improve reliability of systems
  • Implement industry best practices for system development, configuration management and system deployment
  • Ensure seamless flow of information between teams by documenting knowledge gained
  • Be up to date on modern technologies and trends to advocate for inclusion within products if they add value
  • Participate in incident management including troubleshooting production issues, driving root cause analysis (RCA) and actively sharing lessons learned to improve system reliability and internal knowledge.

Qualifications

  • Experience troubleshooting, tuning high performance microservice architectures running on Kubernetes and AWS in highly available production environments.
  • 5+ years experience in software development in Python, Java, Go, etc with strong fundamentals in data structures, algorithms, problem solving and complexity analysis.
    *During the selection process, you will have a coding challenge.
  • Curious and proactive in finding performance bottlenecks, scalability and resilience problem areas and addressing them.
  • Experience with observability tools and gathering data.
  • Database knowledge such as RDS, NoSQL, distributed TiDB, etc.
  • Excellent communication skills, collaborative and getting things done attitude.
  • Enjoy taking up a challenge and driving it to conclusion.

Preferred Qualifications

  • Container image management and optimization.
  • Experience in large distributed system architecture and capacity planning.
  • Understanding of IaC, automation tools, terraform, cloud formation, etc.
  • Background in SRE/DevOps concepts and implementation.
  • Experience in managing monitoring tools like CloudWatch, VictoriaMetrics, Prometheus and reporting with Snowflake and Sigma.
  • In depth knowledge of web technologies such as CloudFront, Nginx, etc.
  • Experience in designing, implementing or maintaining disaster recovery strategies and multi-region architecture to ensure high availability, resilience, and business continuity across critical systems.
  • Language ability in Japanese is a plus.

PayPay 5 senses

  • Please refer  PayPay 5 senses  to learn what we value at work.

Working Conditions 

Employment Status

  • Full Time

Office Location

  • Hybrid Workstyle (flexible working style including Remote and office)
    ※There are no fixed rules regarding office attendance in Product group; it depends on each individual's discretion.

Work Hours

  • Super Flex Time (No Core Time)
  • In principle, 9:00am-5:45pm + 1h break (actual working hours: 7h45m + 1h break)

Holidays

  • Every Sat/Sun/National holidays (In Japan)/New Year's break/Company-designated Special days

Paid leave

  • Annual leave (up to 14 days in the first year, granted proportionally according to the month of employment. Can be used from the date of hire)
  • Personal leave (5 days each year, granted proportionally according to the month of employment)
    *PayPay's own special paid leave system, which can be used to attend to illnesses, injuries, hospital visits, etc., of the employee, family members, pets, etc.

Salary

  • Annual salary paid in 12 installments (monthly)
  • Based on skills, experience, and abilities
  • Reviewed once a year
  • Special Incentive once a year *Based on company performance and individual contribution and evaluation
  • Late overtime allowance

※Payroll payment can be changed to digital salary payment “PayPay Paycheck” for an amount set by you

Benefits

  • Social Insurance (health insurance, employee pension, employment insurance and compensation insurance)
  • 401K
  • Translation/Interpretation support
  • VISA sponsor + Relocation support

Other Information:

Job Tags

Remote job, Full time, Work at office, Visa sponsorship, Relocation package, Flexible hours

Similar Jobs

NPO USA

Senior Network Security Engineer Job at NPO USA

 ...remote with occasional travel to USA, and in Canada. Role Description: We are looking for a highly qualified Senior Network Security Engineer to join our Network & Security Business Unit. The professional will be responsible for the design, implementation,... 

Carrier World

Salesforce Developer Job at Carrier World

 ...Job Title - Salesforce Developer Preferred Location - Bangalore/Hyderabad, India Full time/Part Time - Full Time Build a career with confidence Carrier Global Corporation, global leader in intelligent climate and energy solutions is committed to creating... 

Mayo Clinic

Palliative Medicine Physician Job at Mayo Clinic

 ...more specialties than any other care provider according to U.S....  ...Responsibilities Join the growing Palliative Medicine Program at Mayo...  ...palliative medicine physicians from multiple disciplines and...  ...ACGME-accredited fellowship in Hospice and Palliative Medicine and there... 

110 Rehill Ave

Adjunct Therapist (Art Therapy) - Eating Disorders Outpatient - Somerville, NJ Job at 110 Rehill Ave

 ...will focus on mental health rehabilitation through the creative arts. Qualifications: Required: Bachelor's Degree required...  ...institution and be related to mental health rehabilitation/ Creative Arts Therapy, including but not limited to Art Therapy, Music Therapy, Dance/... 

Priority Delivery Solutions

Delivery Route Driver Helper - Full Time Job at Priority Delivery Solutions

 ...previous experience required. Priority Express at 316 Engineers Drive, Williston, VT, is hiring full time route helpers to assist our drivers in delivering home furnishings and other heavy/bulky items in our company vehicles to residential customers in Central and Northern...