Site Reliability Engineer - Observability @ Flexport - San Francisco, CA

Job Overview

12 days ago

Site Reliability Engineer - Observability

Flexport - San Francisco, CA

The opportunity:

Flexport is looking for observability-minded Site Reliability Engineers (SREs) to help Flexport establish itself as the most trusted company in the global trade ecosystem. We support our development teams by creating safe support structures for them to build from. Some of those support structures are software systems, but the most important structure that we're building is a culture of reliability. We have opinions about how to build and deploy reliable software, and we aren't afraid to share them.

As an Observability SRE, you'll join a team of experts entrusted with defining and defending the observability standards that help our dev teams understand how their software works in practice.

You will:

  • Develop, implement, own, and advance telemetry capabilities used throughout Flexport's Technology organization as well as the infrastructure and services that collect, process, and store the data.
  • Advise engineering teams on observability topics throughout the software development lifecycle.
  • Evangelize Flexport observability capabilities across the organization, provide training, documentation, and enhancements to existing capabilities.
  • Support the resolution of action items identified during incident response retrospectives related to observability deficiencies.

You should have:

  • 3+ years of experience in a fast-paced global environment doing Site Reliability Engineering, Software Engineering, or Systems Engineering.
  • Production-level experience deploying and maintaining commercial or open-source monitoring tools.
  • Production-level experience with AWS and container-based infrastructure.
  • Proficiency in languages such as Python, Go, Java, JavaScript, or Ruby with hands-on experience debugging and optimizing code.
  • Strong interpersonal and communications skills.

About Flexport:

At Flexport, we believe global trade can move the human race forward. That's why it's our mission to make it easy and accessible for everyone. We're shaping the future of a $8.6T industry with solutions powered by innovative technology and exceptional people. Today, companies of all sizes—from emerging brands to Fortune 500s—use Flexport technology to move more than $19B of merchandise across 112 countries a year.

The recent global supply chain crisis has put Flexport center stage as we continue to play a pivotal role in how goods move around the world. At a valuation of $8B, we're experiencing record growth and are proud to have the support of the best investors in the game who believe in our mission, solutions and people. Ready to tackle global challenges that impact business, society, and the environment? Come join us.


Worried about not having any logistics experience?

Don't be! Our mission is to make global trade easy for everyone. That's why it's important to bring people from diverse backgrounds and experiences together with our industry veterans to help move the global logistics industry forward.

We know this industry is complex. That's why we invest in education starting day one with Flexport Academy, a one week intensive onboarding program designed specifically to set every new Flexport employee up for success.

At Flexport, our ability to fulfill our mission of making global trade easy for everyone relies on having a diverse, dedicated and engaged workforce. That is why Flexport is committed to creating and nurturing an environment where anyone can be their authentic self. All qualified applicants will receive consideration for employment regardless of race, color, religion, sex, national origin, age, physical and mental disability, health status, marital and family status, sexual orientation, gender identity and expression, military and veteran status, and any other characteristic protected by applicable law.

To learn more about what our tech teams have been up to, head to the Engineering Blog.

Similar Jobs

Digital Products - Site Reliability Engineer (SRE - DBaaS)

PRICE WATERHOUSE COOPERS

San Francisco, CA

Managing and continually improving platform infrastructure and applications with high reliability, resiliency, performance & quality, and faster time-to-market…

Digital Products - Site Reliability Engineer (SRE - DBaaS)

PRICE WATERHOUSE COOPERS

San Jose, CA

Managing and continually improving platform infrastructure and applications with high reliability, resiliency, performance & quality, and faster time-to-market…

Digital Products - Site Reliability Engineer (SRE - DBaaS)

PRICE WATERHOUSE COOPERS

Sacramento, CA

Managing and continually improving platform infrastructure and applications with high reliability, resiliency, performance & quality, and faster time-to-market…

Senior Principal Site Reliability Engineer

Shutterfly 2.0

Redwood City, CA

At Shutterfly, we make life’s experiences unforgettable. We believe there is extraordinary power in the self-expression. What You’ll Do Here:

Senior Principal Site Reliability Engineer

Shutterfly 2.0

San Jose, CA

At Shutterfly, we make life’s experiences unforgettable. We believe there is extraordinary power in the self-expression. What You’ll Do Here:

Senior DevOps Engineer - Juniper Apstra

Juniper Networks

Sunnyvale, CA

The successful incumbent will possess both a deep technology background and experience in DevOps/ site reliability engineering, tool selection and…

Senior Principal Site Reliability Engineer

Shutterfly

Redwood City, CA

At Shutterfly, we make life’s experiences unforgettable. We believe there is extraordinary power in the self-expression. What You’ll Do Here:

Senior Principal Site Reliability Engineer

Shutterfly

San Jose, CA

At Shutterfly, we make life’s experiences unforgettable. We believe there is extraordinary power in the self-expression. What You’ll Do Here:

Sr. Site Reliability Engineer - DevOps

Kraken Digital Asset Exchange

San Francisco, CA

As one of the largest and most trusted *digital asset platforms* globally, we are empowering people to experience the life-changing potential of crypto.

Site Reliability Engineer (DMaaS)

Cohesity

San Jose, CA

Cohesity is on a mission to radically simplify how organizations manage their data to unlock limitless value. With DMaaS our customers manage their data sources…

Principal Site Reliability Engineer

Palo Alto Networks

Santa Clara, CA

This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability. At Palo Alto Networks®, everything starts and ends…

Principal Site Reliability Engineer (DLP)

Palo Alto Networks

Santa Clara, CA

Be responsible for on-going maintenance and support of internal tools, improving system health and reliability. We’re here for better.

Sr Principal Site Reliability Engineer

Palo Alto Networks

Santa Clara, CA

This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability. At Palo Alto Networks®, everything starts and ends…

Site Reliability Engineer

Cisco Systems

San Jose, CA

Location (Primary): San Jose, California, US. Today’s challenging business environment is more than that – it’s a period of disruption between the pandemic,…

Site Reliability Engineer

Guardant Health

Palo Alto, CA

Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.

Site Reliability Engineer

IBM

Mountain View, CA

Experience understanding large-scale complex systems from a reliability perspective including resolving issues and identify strategies to mitigate in future.

Principle Site Reliability Engineer (Prisma Cloud Security) - Can be remote in the US

Palo Alto Networks

Santa Clara, CA

We’re a group of software engineers, site reliability engineers, and security experts that own the deployment architecture and strive to improve our…

Principal Site Reliability Engineer (WildFire Cloud Infrastructure) - can be remote in the US

Palo Alto Networks

Santa Clara, CA

This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability. At Palo Alto Networks® everything starts and ends with…

Principal Site Reliability Engineer

Accela

San Ramon, CA

Accela provides cutting edge technology for government agencies to engage and serve their citizens. The cornerstone of our technology is the Civic Cloud…

Cloud Site Reliability Engineer- Watson Orders

IBM

San Jose, CA

Watson Orders is a IBM Silicon Valley based technology development group targeting the development of world-class conversational AI.

Site Reliability Engineer

Qualys

Foster City, CA

Advice the cloud platform team to improve the reliability of the systems in production and scale them based on need. Proficient in writing bash scripts.

Senior Site Reliability Engineer - DNS Infrastructure

Roblox

San Mateo, CA

Every day, tens of millions of people from around the world come to Roblox to play, learn, work, and socialize in immersive digital experiences created by the…

Site Reliability Engineer | Production Engineer

Convex

San Francisco, CA

Help with establishing reliability guidelines and ensuring systems meet goals around durability, availability and performance.

Site Reliability Engineer - Client Support Services

Kraken Digital Asset Exchange

San Francisco, CA

As one of the largest and most trusted *digital asset platforms* globally, we are empowering people to experience the life-changing potential of crypto.