Pear VC Logo

Pear VC

Founding Machine Learning Engineer

Posted 12 Days Ago
7 Locations
Mid level
7 Locations
Mid level
As a Machine Learning Engineer, you will develop and train models for data knowledge extraction, fine-tune LLMs, and collaborate across data infrastructure.
The summary above was generated by AI

Founding Machine Learning Engineer

Full-Time - Engineering - SF Bay

About Us

We are on a mission to bridge the gap between enterprise business knowledge and data, democratizing data discovery and curation to prepare organizations for the era of generative AI. Today's data tools are overly complex, poorly integrated, and siloed, forcing AI Practitioners and data scientists alike to spend more time wrestling with tools, relying on tribal knowledge, and navigating data lakes rather than doing meaningful data science work. The current landscape of data tools and processes is heavily manual and needs to catch up with the vast amount of data generated daily. With the advent of Gen AI and multi-modality, this challenge has only grown more complex and broken.

Backed by top VC funds, we are committed to making enterprise data AI-ready faster, more reliably, and with a stronger foundation of factual semantic knowledge. This leads to more accurate models, superior outcomes, and better business results. Our team of seasoned data infrastructure and machine learning experts (from LinkedIn, Visa, Truera, Hive, and Branch) has spent the past two decades building bespoke systems to solve these very challenges.

Join our growing team of ML research and data infrastructure experts. We're committed to empowering AI and data scientists to seamlessly integrate semantic learning with generative AI. Be part of our journey to shape the future of enterprise AI.

About the job

We are looking for a Machine Learning Engineer to join our team who is based in the Bay Area or willing to move. The ideal candidate should have expertise in one or more of the following areas: Knowledge Extraction, Natural Language Understanding, Unsupervised Learning, Information Retrieval, and Fine-tuning LLMs. In this role, you'll play a critical part in developing and training the models, pipelines, and methodologies that power our semantic graph systems. We're looking for someone with a strong background in machine learning, natural language processing, LLMs, and semantic technologies, with a proven track record of tackling complex, large-scale machine learning projects.

What You Will be Doing

  • Build and/or use best-in-class models to extract knowledge from heterogeneous sources

  • Develop methods to build and evaluate AI Data Graphs

  • Fine-tuning LLMs with domain-specific context 

  • Work with data infra engineers to develop the best platform for your needs


Prior Experience

  • MS degree in CS or equivalent

  • Startup experience is highly preferred

  • 3+ yrs experience in Machine Learning or Knowledge Extraction

  • 3+ yrs experience working with text

  • Experience working with and fine-tuning language models such as BERT, LLM, SLMs

  • Experience with NLP tools such as spacy, openNLP, openNER, GLiNER, etc.

  • Experience with embedding-based retrieval

  • Strong background in the fundamentals of machine learning

  • Deployed and maintained ML, NLP or LLM models in production

  • Strong data manipulation skills using tools such as numpy and pandas

  • Great communication skills and a team player

Nice to haves

  • Familiar with LLM ecosystem and best practises of fine-tuning and prompt-engineering

  • Experience working on ML and data in the cloud

  • Experience with GPU optimization

  • Experience working with docker, k8s

  • Experience working with ray,vllm

Top Skills

Docker
Fine-Tuning Llms
Gliner)
Information Retrieval
K8S
Knowledge Extraction
Natural Language Understanding
Nlp Tools (Spacy
Numpy
Openner
Opennlp
Pandas
Ray
Unsupervised Learning
Vllm

Similar Jobs

12 Days Ago
7 Locations
Senior level
Senior level
Angel or VC Firm
As a Founding Engineer, you will guide the technical direction and build GPU infrastructure for real-time AI, optimizing inference engines and engaging with customers.
Top Skills: CudaDockerFirecrackerKubernetesLlama.CppMlNomadPyTorchRayTerraformTransformersVllm
An Hour Ago
Remote
Hybrid
Toronto, ON, CAN
Mid level
Mid level
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
Design and develop Computer Vision and Machine Learning algorithms, write production-quality C++ code for real-time systems, and create testing tools.
Top Skills: C++Computer VisionLinuxMachine LearningOpencvPython
4 Hours Ago
Remote
Hybrid
7 Locations
Senior level
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The role involves designing and operating APIs for authentication, evolving system architecture, mentoring teammates, and defining product roadmaps. Candidates should have experience in software development and customer-centric design.
Top Skills: AWSDatadogDynamoDBGoGrpcHibernateHTTPJavaJSONKafkaKotlinKubernetesMySQLPrometheusProtocol Buffers

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account