Senior C++ Engineer (Inference Optimization)
December 31, 2024
Open
Open
Location
Anywhere
Occupation
Full-time
Experience level
Senior
Senior C++ Engineer (Inference Optimization)
Responsibilities
- Lead inference development efforts
- Contribute towards Cortex.cpp (github), a multi engine C++, Inference Server for developers
- Contribute towards important open source dependencies like llama.cpp, ONNXRuntime, OpenVINO
Architecture and Planning
- Break down ambiguous goals and high level goals into a well architected, technical execution plan.
- Implement cost-effective solutions, innovates with rapid spikes, avoids “penny wise, pound foolish” trade offs.
Requirements
- Proven experience in C++ development.
- Experience with building a C++ application from the ground up
- Solid understanding of gRPC and its applications in web server optimization.
- Experience in high-performance computing, particularly in hardware optimization and embedded systems.
- Minimum of 3 years of professional work experience in a similar role.
- Self-starter, entrepreneurial mindset, and ability to work independently.
Benefits
- We pay an “all-in” pay and you will cover your own insurance/medical from the amount
- 14 days leave (and unlimited sick days)
- Annual equipment budget (once 2 month probation has been completed)
* Please submit only 1 application, as you will be considered across roles. Duplicate submissions will be automatically archived.
About Homebrew:
Homebrew is an AI R&D studio. We work in the broad area of Local AI, Small Language Models and Multi-modality.
We are the creators and lead maintainers of:
- 👋 jan.ai: Personal AI (2 million+ downloads)
- 🤖 cortex.so: Self-hosted AI Platform
- 🍓 Ichigo-llama3: Native speech model
We are a remote first and build in public company. Read more about our company's culture and principles here. Follow us on Linkedin and X.
Homebrew Research
Homebrew is an AI R&D studio that works in the area of Local AI, Small Language Models and Multi-modality. Homebrew is a local AI company. We're the creators and lead maintainers of Jan, a local AI assistant, and Cortex, a local AI engine. We're building products and training AI publicly to solve real AI problems. Products - 👋 Jan: Local AI Assistant (>1.6 million downloads) - 🤖 Cortex: Local AI Engine We train open-source models: - 🍓 Ichigo: Local Real-Time Voice AI AI comes with hardware, optimization, and efficiency problems at a high level. Solving these problems is about experimenting with AI in these levels. At Homebrew, we're doing this. Jan & Cortex help us to experiment with software and hardware. Ichigo helps us to do the same on the training level. Plus, we tinker with hardware: - ⛩️ Xanadu: GPU Cluster (Coming Soon) - 💡 Menlo: AI Hardware (Coming Soon) We're a fully remote team. See open roles: https://homebrew.bamboohr.com/careers We're always looking for AI enthusiasts and tinkerers who want to join us: hello@homebrew.ltd
HQ Location
Singapore, Singapore
Company type
Scale-up
Domain
Information Technology & Services