Head of Reliability - Blueflame AI Job at Datasite, New York, NY

VzBaUEFHTWhTWmZzRy9kcnkxbFNub0dwcGc9PQ==
  • Datasite
  • New York, NY

Job Description

Head of Reliability - Blueflame AI page is loaded## Head of Reliability - Blueflame AIremote type: Hybridlocations: USA - NY - New York City - BlueFlame AItime type: Full timeposted on: Posted 4 Days Agojob requisition id: R35524Datasite and its associated businesses are the global center for facilitating economic value creation for companies across the globe. From data rooms to AI deal sourcingand more. Here you’ll find the finest technological pioneers: Datasite, Blueflame AI, Firmex, Grata, and Sherpany. They all, collectively, define the future for business growth. Apply for one position or as many as you like. Talent doesn’t always just go in one direction or fit in a single box. We’re happy to see whatever your superpower is and find the best place for it to flourish. Get started now, we look forward to meeting you..**Job Description:**Blueflame AI for Datasite is looking for a **Head of Reliability** to own reliability, quality, and release assurance across the entire Blueflame AI platform.This is not a support role — it’s a technical leadership position that combines QA and platform reliability ownership to ensure that every feature shipped is tested, stable, and trustworthy.You’ll manage the reliability roadmap, set quality standards, and work closely with our engineering and product teams to make reliability a priority in everything we build.**Key Responsibilities****Quality Assurance (QA) Ownership*** Lead the QA function — defining frameworks, tooling, and processes for automated and manual testing.* Ensure every release meets strict reliability and data integrity standards.* Work with engineering to build and maintain CI/CD-integrated test automation for frontend, backend, and model workflows.* Partner with product managers to define acceptance criteria, regression suites, and go/no-go release thresholds.**Reliability & Platform Resilience*** Define and own Blueflame’s reliability strategy — uptime, latency, and system integrity across core services (API, search, context engine, data integrations).* Establish and manage SLOs/SLIs with engineering squads, ensuring proactive monitoring and error budgeting.* Review architectural designs for resilience, scalability, and recoverability.* Implement and manage monitoring and alerting across our platform, including within AWS. Oversee observability stack and monitoring pipelines (logs, metrics, traces, dashboards).* Establish real-time performance insights and alerting mechanisms.**Release Assurance & Continuous Improvement*** Implement consistent release and rollback processes across environments.* Manage release readiness reviews and reliability audits.* Work with support team for post-incident reviews and implementation of long-term fixes.**Leadership & Culture*** Build and lead a small, high-impact reliability engineering and QA team.* Champion quality-by-design principles within all engineering squads.* Assist with SOC-2 readiness.**Requirements*** 8+ years in reliability, QA, or platform engineering roles, including 1+ years in a management role.* Strong experience designing and running QA and automated testing frameworks within CI/CD pipelines.* Hands-on experience with AWS cloud infrastructure and observability tools including Datadog and ELK stack.* Familiarity with LLM or AI-driven systems a plus (especially testing non-deterministic or probabilistic outputs).* Track record of improving uptime, release quality, and user trust in production environments.* Excellent collaboration skills — able to work across Product, Engineering, and Security functions.The base salary range represents the estimated low and high end for this position based on a good faith assessment of the role and market data at the time of posting. Consistent with applicable law, each candidate’s compensation offer may vary and will be determined based on but not limited to, your geographic region, skills, qualifications, and experience along with the requirements of the position. This position may be eligible for bonuses, commissions, or overtime if applicable. Benefits include health insurance (medical, dental, vision), a retirement savings plan, paid time off, and other employee benefits. Specific details will be provided during the interview process. Datasite reserves the right to modify this pay range at any time.$141,000.00 - $248,000.00Our company is committed to fostering a diverse and inclusive workforce where all individuals are respected and valued. We are an equal opportunity employer and make all employment decisions without regard to race, color, religion, sex, gender identity, sexual orientation, age, national origin, disability, protected veteran status, or any other protected characteristic. We encourage applications from candidates of all backgrounds and are dedicated to building teams that reflect the diversity of our communities. #J-18808-Ljbffr

Job Tags

Full time,

Similar Jobs

Culy Contracting

Heavy Duty Truck Mechanic Job at Culy Contracting

 ...employees and makes an impact every day? Apply now for the perfect opportunity with Culy Contracting! We're looking for a Heavy Duty Truck Mechanic with experience working on heavy equipment and diesel trucks in Winchester, IN, to work on a wide range of vehicles and... 

BrightBlitz Marketing

Junior SEO Specialist (LATAM) - Entry Level, Full-Time, Remote Job at BrightBlitz Marketing

 ...Were hiring a full-time, entry-level SEO Specialist in Latin America. This is a long-term role, not a short freelance gig. You do not need expert SEO experience. We will train you using clear SOPs and checklists. What matters most is reliability, honesty, and following... 

Google

Principal Incident Response Security Consultant, Mandiant, Google Cloud Job at Google

Principal Incident Response Security Consultant, Mandiant, Google Cloud_corporate_fare_ Google _place_ Illinois, USA _laptop_windows_ Remote eligible**Advanced**Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep... 

EdOps

School Finance Manager (CFO/Director of Finance) Job at EdOps

 ...Director or above. Location flexible (this is a remote position). Part time arrangements possible for exceptional candidates. Who we are:...  ...join our team. What you will do: Act as an outsourced CFO for a portfolio of 5-8 schools, advising these schools' leaders... 

QTS Data Centers

Marketing Events Coordinator Job at QTS Data Centers

 ...knowledgeable, resourceful and mission driven. Together, we do great things. Who You Are & The Impact You Will Have: The Marketing Events Coordinator will support achievement of the business through executing on an event program to include company-hosted events,...