Featherless AI Logo

Featherless AI

Senior Software Engineer - API Gateway

Posted 20 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Canada
Senior level
Remote
Hiring Remotely in Canada
Senior level
Develop and enhance the API gateway for an AI inference platform, focusing on feature implementation, bug fixes, infrastructure management, and reliability improvements.
The summary above was generated by AI

About the Role

Featherless.ai is building the world’s most reliable and comprehensive open-model inference platform — the infrastructure powering the next generation of AI creators, startups, and enterprises. Our serverless approach to inference unlocks the best GPU utilization in AI infrastructure.

We’re hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, which is responsible for

  • authentication and inference to all models

  • subscription management and subscription entitlement (e.g. context-length, concurrency limits)

  • and providing the necessary API surface for applications and builders

API Gateway is constantly evolving in response to the unending stream of new models, modalities, clients and inference load.

What you'll do

The API gateway is managed by the Platform Team, who aim to make Featherless the best place to find and use models. As a member of the platform team, you will

  • undertake feature development and bug fixes to keep up with clients, resolve user issues, and onboard new models

  • improve the reliability of the existing API (increasing instrumentation and monitoring, right-sizing infrastructure)

  • respond to availability incidents

  • triage and resolve issues of inference quality and reliability

  • manage the infrastructure on which our gateway runs

What you'll bring

  • first-hand experience of the user’s we’re building for (familiarity with popular open LLMs, common clients, and experience building with LLM)

  • experience with the technologies and paradigms of the web (REST, websockets, DNS, networking, opentelemetry)

  • experience with significant components of our stack (k8s, node, mikro-orm, fastify, redis, mongodb, python, elastic cloud, cloudflare, sentry, otel)

  • ability to debug complex issues across a wide stack and build instrumentation as necessary

  • desire to work collaboratively as part of a skilled team

  • Alignment with team and company values, including

    • bias to action

    • responsiveness to users (bug-fixes over features)

    • instinct to iterate

    • subscribing to that done means proven by usage data

Other

This team operates on Eastern Time. We are remote, but with a preference to hire in Toronto, Canada.

Top Skills

Cloudflare
Dns
Elastic Cloud
Fastify
K8S
Mikro-Orm
MongoDB
Networking
Node.js
Opentelemetry
Otel
Python
Redis
Rest
Sentry
Websockets

Similar Jobs

27 Minutes Ago
Easy Apply
Remote or Hybrid
7 Locations
Easy Apply
Senior level
Senior level
Big Data • Cloud • Software • Database
The Escalation Engineer at MongoDB will solve complex customer issues, mentor team members, and contribute to internal projects, requiring a strong technical background and customer leadership skills.
Top Skills: AWSAzureCC#C++GCPGoJavaJavaScriptLinuxNasNode.jsPythonRubySanSsd
32 Minutes Ago
Remote
Canada
Mid level
Mid level
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Build and maintain petabyte-scale data platform components for ingestion, storage, and processing. Contribute to data lake modernization, enable AI/ML workflows, integrate with partner teams, and participate in on-call operations and incident response.
Top Skills: AirflowBigQueryC#DatabricksGoHiveJavaKafkaPythonRedshiftSnowflakeSparkSparksqlSuperset
2 Hours Ago
Remote
Canada
Senior level
Senior level
Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
Lead establishment and operation of an Adobe Target experimentation COE: define governance, enable teams, ensure QA and statistical rigor, drive measurement and personalization adoption, provide training and consulting across experiment lifecycle.
Top Skills: A/B Testing,Multivariate Testing,Cdp,Customer Data Platform,Marketing Automation,Omni-Channel Campaign Management,Ai-Driven Optimization,Gdpr,CcpaAdobe Target,Adobe Target Premium,Auto-Target,Auto-Allocate,Experience Targeting,Personalization

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account