All roles

Solutions Engineer — Media Generation

Remote · USA Full-time New today

About Artificial Analysis Artificial Analysis is the leading independent AI benchmarking company. We support labs, engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier. Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist. We are a team of 35+, on track to triple by year end, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, DeepLearning.ai, Amazon), Adam D'Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders. The Opportunity Artificial Analysis benchmarks leading image and video generation models, providing the AI industry with independent quality and performance comparisons. Our media generation benchmarks rely on structured human preference evaluations to assess output quality across models. We're hiring a Solutions Engineer to manage our media generation benchmarking pipeline. You'll run image and video generation evaluations, manage human preference studies, and serve as a technical point of contact for media generation model providers. This is a process-driven, operational role suited to someone who is detail-oriented, comfortable with Python, and can manage pipelines reliably day-to-day. What You’ll Do Generate image and video outputs across models according to standardized evaluation protocols Set up and manage human preference evaluation studies, including study design, participant management, and quality control Process and analyze preference vote data to produce benchmark results Manage the end-to-end pipeline: from prompt execution through to published results Serve as a technical point of contact for media generation model providers — communicating results, explaining methodology, and handling queries Monitor data quality, flag anomalies, and ensure consistency across evaluation rounds Maintain documentation of processes and configurations Stay current with new image and video model releases What We’re Looking For Required: 3+ years of experience in a technical operations, data operations, or solutions engineering role Comfortable with Python scripting and working with APIs Experience managing research studies, data collection pipelines, or crowdsourcing platforms is a strong plus Detail-oriented with strong process management skills — you can run recurring workflows reliably without oversight Good written and verbal English communication skills Responsive, organized, and dependable Nice to have (not required): Experience with image or video generation models (Midjourney, DALL-E, Stable Diffusion, Runway, Sora, etc.) Background in data analysis or research operations Familiarity with human evaluation methodologies or preference-based ranking systems Experience in B2B SaaS or developer tools Why Artificial Analysis? Shape how AI gets built: The leading AI labs track our benchmarks and use them to guide their development priorities. Your work will directly influence the direction of AI. Become a world expert in AI: You will evaluate every major model, across every major capability, as they are released. Very few roles offer this breadth of exposure to frontier AI. Work with the most important players in AI: You'll manage relationships with teams at the leading AI labs and major enterprises as a trusted, independent voice. Join at a defining moment: We're 35+ people and fast growing, backed by some of the most connected investors in AI. The people who join now will shape the product, the team, and the strategy as we scale. Competitive compensation including equity Our team is split across San Francisco, Sydney, and Melbourne Apply To This Job

Related roles

Chief of Staff to the CEO

Remote · USA Full-time

Specialist - Assets & Rights Management

Remote · USA Full-time

Customer Team Leader (District Sales Manager), Vaccines - Eastern PA

Remote · USA Full-time

Credentialing Specialist

Remote · USA Full-time

B2B Outreach Specialist

Remote · USA Full-time

Sr. Associate Data Scientist

Remote · USA Full-time

Care Consultant (Caregiver Services)

Remote · USA Full-time

Corporate Account Manager - Institutional "Gain" (Remote)

Remote · USA Full-time

Technical Consulting Database Mainframe Expert

Remote · USA Full-time

HR Assistant

Remote · USA Full-time

Experienced Full Stack Technical Writer – Cloud Database Documentation

Remote · USA Full-time

Experienced Full Stack Customer Support Representative – Live Chat Agent Role at arenaflex

Remote · USA Full-time

Geriatric Nurse Informatics Specialist – Cerner Solution Remote / Tele – Hiretide Store

Remote · USA Full-time

Remote Data Entry Specialist – Home‑Based Position with arenaflex – No Experience Required, Flexible Schedule & Full Training

Remote · USA Full-time

Remote Vice President, Market Operations

Remote · USA Full-time

Account Executive - Dental

Remote · USA Full-time

Online Instructor – Perspectives on Indigenous Health and Culture in Canada

Remote · USA Full-time

Experienced Full Stack Data Entry Specialist – Cloud-Based Data Pipelines and ETL Process Management

Remote · USA Full-time

Senior Beauty & Wellness Brand Strategist (Remote)

Remote · USA Full-time

Inside Sales – B2C

Remote · USA Full-time