[Remote] Software Engineer, Inference - Multi Modal

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are looking for a software engineer to help serve OpenAI’s multimodal models at scale, focusing on building reliable infrastructure for real-time audio and image processing. Responsibilities • Design and implement inference infrastructure for large-scale multimodal models • Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs • Enable experimental research workflows to transition into reliable production services • Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities • Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers Skills • Experience building and scaling inference systems for LLMs or multimodal models • Worked with GPU-based ML workloads and understand the performance dynamics of large models, especially with complex data like images or audio • Enjoy experimental, fast-evolving work and collaborating closely with research • Comfortable dealing with systems that span networking, distributed compute, and high-throughput data handling • Familiarity with inference tooling like vLLM, TensorRT-LLM, or custom model parallel systems • Own problems end-to-end and are excited to operate in ambiguous, fast-moving spaces • Design and implement inference infrastructure for large-scale multimodal models • Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs • Enable experimental research workflows to transition into reliable production services • Collaborate closely with researchers, infra teams, and product engineers to deploy state-of-the-art capabilities • Contribute to system-level improvements including GPU utilization, tensor parallelism, and hardware abstraction layers • Experience working with image generation or audio synthesis models in production • Exposure to distributed ML training or system-efficient model design Company Overview • OpenAI is an AI research and deployment company that develops advanced AI models, including ChatGPT. It is a sub-organization of OpenAI Foundation. It was founded in 2015, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website is Company H1B Sponsorship • OpenAI has a track record of offering H1B sponsorships, with 1 in 2025, 1 in 2024, 1 in 2023, 18 in 2022, 10 in 2021, 6 in 2020. Please note that this does not guarantee sponsorship for this specific role. Apply tot his job
Apply Now

Similar Opportunities

Full Stack Software Engineer, AI Features

Remote

AI Solution Architect (Remote)

Remote

Principal ML Ops Engineer, AI Inference

Remote

Copy of Senior AI/ML Engineer, Applied Machine Learning - (Security Clearance)

Remote

AI Transformation Senior Manager - Communication, Media, Technology

Remote

Alorica Work At Home: Customer Experience: $13 – $15 in Plantation, Florida

Remote

Technical Support Customer Service. (Hybrid- Onsite/Remote). Tucson Candidates Only in Tucson, AZ

Remote

[High Paying] Amazon Customer Service Jobs, Work from Home

Remote

Work From Home Amazon Data Entry Jobs No Experience Remote - Part-Time

Remote

Amazon careers remote jobs - No Experience

Remote

**Experienced Entry-Level Data Entry Specialist – E-commerce Operations Support (Part-Time)**

Remote

Raytheon Technologies – Supply Chain Commodity Manager-Composites/Plastics (remote) – USA

Remote

Experienced Customer Service Representative – Remote Opportunity for Career Growth and Development with blithequark

Remote

Experienced Virtual Customer Care Representative – Homebuilding Industry Expertise with Emphasis on Customer Relations, Trade Partner Management, and Community Development

Remote

Experienced Lead Data Operations Analyst – Data Entry and Analysis Expert for Financial Services Industry at arenaflex

Remote

Social Media Content Creator

Remote

Experienced Remote Data Entry Specialist – Flexible Work from Home Opportunity with arenaflex

Remote

**Experienced Full Stack Data Entry Specialist – E-commerce Operations and Amazon Platform Management**

Remote

Experienced Remote Data Entry Specialist – Work from Home Opportunity with Competitive Pay and Flexible Hours at blithequark

Remote

Experienced Data Entry Associate - Remote Opportunity with Competitive Compensation ($30-40/hour) at blithequark

Remote
← Back to Home