Audio Inference Engineer, Model Efficiency
Cohere
Who are we? Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems. We're training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate about their craft. We are a global technology company co-headquartered in Toronto and San Francisco, with key offices in London, New York City, Montreal, Seoul, Germany and Paris. Join us! Why this role? Our team is a fast-growing group of committed researchers and engineers. The mission of the team is to build reliable machine learning systems and optimize audio inference serving efficiency using innovative techniques. As an engineer on this team, you will work on advancing core audio model serving metrics, including latency, throughput, and quality by diving deep into our systems, identifying bottlenecks, and delivering creative solutions for audio processing and streaming workloads. You'll collaborate closely with both the training and serving infrastructure teams to ensure seamless integration between model development and deployment, with a special focus on real-time and streaming audio inference. Please Note: We have offices in Toronto, Montreal, San Francisco, New York, Paris, Seoul and London. We embrace a remote-friendly environment, and as part of this approach, we strategically distribute teams based on interests, expertise, and time zones to promote collaboration and flexibility. You'll find the Model Efficiency team concentrated in the EST and PST time zones, these are our preferred locations. You may be a good fit for the team if you have:
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Full-Time Employees at Cohere Enjoy These Perks:
- Significant experience developing high-performance audio or machine learning inference systems.
- Proficiency with programming languages such as C++ and Python.
- Hands-on experience with deep learning models for audio, speech, or language applications.
- A bias for action and a strong results-oriented mindset.
- GPU programming, low-level system optimization, model parallelization techniques over multiple GPUs
- Have experience with duplex real-time streaming architectures.
- Internals of machine learning frameworks for audio (such as PyTorch, TensorFlow, or specialized audio libraries).
- Have experience with inference framework like vLLM, SGLang, Tensort-LLM, or custom distributed inference systems
- Sequence modeling (e.g., transformers for audio/speech) and end-to-end audio pipeline optimization
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Full-Time Employees at Cohere Enjoy These Perks:
- An open and inclusive culture and work environment
- Work closely with a team on the cutting edge of AI research
- A weekly lunch stipend of $75/£75 or equivalent in your local currency for lunch.
- Full health and dental benefits, including a separate budget for mental health.
- RRSP matching, 401K, Pension Scheme.
- 100% Parental Leave top-up for up to 6 months, for either parent.
- Annual enrichment benefits:
- Arts & culture, fitness/wellness, quality time, and a workspace improvement credit.
- Education & learning stipend for conferences, courses, and coaching.
- Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
- 6 weeks of paid vacation (30 working days!)
- Budget for traveling to other offices if you are remote, plus an annual company offsite.
- Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon.
- For those in the office: a daily lunch program, plenty of snacks, and regular community and social events.
- For those not near an office: a co-working benefit so you can work alongside others in your city.
- Everyone receives a $500 home office stipend to set up your workspace properly.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Audio Inference Engineer, Model Efficiency in New York, NY vacancy
- Cohere is seeking an engineering professional in New York to develop and optimize audio machine learning systems. You will work with... ...teams to improve audio model metrics, addressing latency and... ...while ensuring real-time audio inference integration. The ideal candidate...SuggestedRemote job
- ...Job We are looking for an experienced AI Model Engineer with deep expertise in kernel... ...acceleration. The engineer will extend the inference framework to support inference and fine... ...advanced quantization techniques to improve efficiency and memory usage. Debug and optimize...SuggestedRemote job
- ...impact by delivering high-quality model serving infrastructure and... ...empower data scientists, engineers, and business stakeholders alike... ...spanning model deployment, inference infrastructure, and lifecycle... ...model availability, and cost efficiency. Lead end‑to‑end product delivery...Suggested
$136.8k - $292.6k
...You will be deploying your engineering, data analytics and data science... ...in frameworks for auditing models, including criteria like... ...different business verticals to efficiently support audit engagements;... ...methods, data pipelines, and inference behaviors. - Capability to...SuggestedTemporary workLocal area$16.5 - $18 per hour
...marketing, and operations. This role supports audio production and event operations at... ...qualified persons available in a timely and efficient manner. Live Nation may pursue all... ...Employment type: Part-time Job function: Engineering and Information Technology Industries:...SuggestedHourly payPart timeLocal area- Overview Audio Engineer - Pro Tools (AI Training) We're looking for experienced Audio Engineers... ...and industry standards Work efficiently in Pro Tools to edit, balance, and enhance... ...dataset creation, labeling, or AI audio model training projects Familiarity with machine...Hourly payContract workRemote work
- ...Function We are searching for a versatile and highly skilled Audio Engineer (A1) to join our dynamic production team! This is a... ...setting up, configuring, and breaking down complex audio systems efficiently and safely. Live Mixing & Operation: Operating professional...Part timeFreelanceLocal areaNight shift
$170k - $240k
...York is seeking a skilled machine learning engineer to develop innovative solutions for a new... ...learning and a strong background in audio modeling, ensuring successful product deployment. The role focuses on optimizing inference pipelines and collaborating with a cross-...- ...Senior Consultant – Model Risk Governance Arrayo is seeking an experienced Model... ...opportunities to strengthen controls, improve efficiency, and enhance transparency across the... ...in Finance, Economics, Mathematics, Engineering, Business Administration, Risk Management...
- ...Direct message the job poster from EDGE Engineering and Science, LLC Recruitment Director @... ...We are seeking a Senior Air Dispersion Modeler to lead and manage complex air quality modeling... ...EDGE Engineering and Science, LLC by 2x Inferred from the description for this job...Full timeFor contractorsLocal areaRemote work
$90k - $100k
...Description: BASIC FUNCTION: Assists the Divisional Director of Residential Services in operating the program effectively and efficiently in compliance with the policies and procedures of LESC, OASAS regulations, CARF standards, and other oversight agencies. Provides...Full timeLive outMonday to Friday- ...Job Title Analyst Or Associate Level Of Credit Risk Model Owner Job Description An international banking organization seeks... ...governance processes and model documentation standards to improve efficiency and accuracy of model validation process Interpersonal...Work experience placementWork at officeLocal area
- ...research lab dedicated to building foundation models for environments that require deep... ...systems, with an understanding of compute efficiency, distributed training, and data... ...gaming AI. Experience with Unity, Unreal Engine, or custom simulators is a plus. Our...Work at office
- ..., and emotional well-being. DTCC offers a flexible/hybrid model of 3 days onsite and 2 days remote (onsite Tuesdays, Wednesdays... ...risk, increasing transparency, enhancing performance and driving efficiency for thousands of broker/dealers, custodian banks and asset...Remote workFlexible hours
$120k - $205k
...business units across the Firm to realize efficient risk-adjusted returns, acting as a... ...credit, market, liquidity, operational, model and other risks. Background on the Position... ...workflows Apply software engineering techniques to build efficient, reliable,...Temporary workWork at office3 days per week- ...FOR INTERNS WHO WORK WITH RECORDING ARTISTS, FILM/TV/LIVE SOUND ENGINEERS ARE NOT IDEAL CANDIDATES.* Company Description Invite... ...is intended for candidates seeking to advance their careers as Audio Engineers, particularly those with regular experience working with...Summer workInternship
- Title Senior Associate - Quantitative Liquidity & Market Risk (Model Development & Analytics) Office Status Hybrid - New York, NY... ...optimization of model workflows and risk analytics processes to improve efficiency and scalability Produce model‑based risk reporting and...Work at office
- Job Title: B2B Strategy & Operating Model Lead Location: 100% Remote Contract: 04+ months Description: We’re looking for a senior commercial... ...workflows Quantify risks and opportunities across revenue, efficiency, and partner experience Design a shared B2B segmentation model...Contract workRemote work
$137.5k - $157.5k
...churn and billions in premiums flowing through hx. About the Model Development team The Model Development team operates at the intersection... ...measurable improvements in pricing accuracy and operational efficiency. Apply a data‑driven approach to prioritize project work,...Work at officeRemote workFlexible hoursNight shift$216k - $270k
...As a Software Engineer on the ML Infrastructure team, you will design... ...for scalable, reliable, and efficient serving of LLMs. Our platform... ...to integrate and optimize models for production and research use... ...TensorRT-LLM, or text-generation-inference. Compensation packages...Full time$163.6k - $225k
...Senior Data Scientist focusing on AI & Model Risk, you will lead and coordinate AI risk... ...with senior stakeholders across Risk, Engineering, Legal, Compliance, and Information Security... ...tools to evaluate job applications for efficiency and consistency. These tools comply with...Work experience placementWork at officeLocal areaRemote workFlexible hours- ...consisting of a variety of LLM, speech, and vision models. Partner with ML infrastructure and training engineers to build a fast, cost-effective, accurate, and... ..., caching, and custom kernels to speed up inference. Find ways to reduce model initialization times...Full timeContract workFlexible hours
- A dynamic production company is seeking a skilled freelance Audio Engineer (A1) to manage technical aspects of audio production across various environments. Responsibilities include ensuring high-quality sound for live concerts, broadcasts, and corporate events while collaborating...FreelanceFlexible hours
- Overview Handshake is recruiting professionals with experience in technical tools such as Audacity, Ardour, Linux Multi‑Media Studio and more to contribute to an AI research project. In this program, you’ll apply your expertise to help improve how AI understands real‑world...Work experience placementRemote workFlexible hours
- Handshake is seeking experienced professionals to contribute to an AI research project, working remotely and asynchronously. You will apply your expertise in tools like Audacity, Ardour, and Linux Multi‑Media Studio to improve AI's understanding of professional tasks. The...Remote jobFlexible hours
- Advanced Systems Group, LLC is seeking an Audio Engineer to support the audio needs of studio events, ensuring excellent sound quality during both streaming and live experiences. The ideal candidate must possess proficiency in operating Audio Mixing tools and have strong...
- ...operating equipment, troubleshooting systems, and participating in production meetings. Candidates should have at least 3 years of audio experience and strong communication skills. Physical requirements include lifting up to 50 lbs and the ability to walk 5 miles as needed...
$65 per hour
Create, edit, and mix multi-track audio projects using Ardour or LMMS. Demonstrate end-to-end audio production workflows from concept to final mix. Experiment with sound design, layering, and real-time adjustments during production. Apply professional mixing techniques...Remote job$46 - $50 per hour
...partners with organizations to improve efficiency, transform operations, and drive business... ...technology, and data insight. Job Title: Model Risk Management (MRM) Product Lead... ...plans. Partner with business, risk, engineering, and data teams to align features with regulatory...Temporary work$89.25k - $150.25k
...Manager, Reliance Model Program Governance & Privacy Champion New York, NY, United States(Hybrid) Job Description Enterprise... ...delivering best-in-class services that power safe, resilient, and efficient operations around the world. The Corporate Functions...Full timeWork at officeLocal areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Audio Inference Engineer, Model Efficiency. Be the first to apply!
Related searches


