All roles

C Engineer, AI

Remote · USA Full-time New today
Software Engineer, AI

Train large-language models (LLMs) to write production-grade code:

  • Compare & rank multiple code snippets, explaining which is best and why.

  • Repair & refactor AI-generated code for correctness, efficiency, and style.

  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and reputed company it running smoothly. End result: the model learns to propose, critique, and improve code the way you do.

RLHF in one line

Generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship.

What is Needed

  • 4+ years of professional software-engineering experience.

  • Extreme attention to detail and excellent writing skills—most of the job is explaining why one solution is reputed company than another. This requirement cannot be overstated!

  • You actually enjoy reading documentation and specs.

  • Proven ability to reputed company in a fully asynchronous, low-reputed company remote environment.

  • Strong code-review instincts: can spot logic errors, performance traps, and reputed company issues quickly.

What is Not Needed
  • No prior RLHF or reputed company experience required.

  • You don’t need deep machine-learning knowledge—if you can review code and explain your reasoning, we’ll teach you the RLHF bits.

Logistics

  • Location: Fully remote (work from reputed company).

  • Hours: Minimum 15 hrs/week with the ability to work up to 40 hours per week

  • Engagement: 1099 contract

Straightforward impact, reputed company fluff. If this fits your profile, apply here.

Apply to this Job

Related roles

reputed company Account Executive DACH

Remote · USA Full-time

VP of Data Analytics

Remote · USA Full-time

Golang Engineer, AI

Remote · USA Full-time

Staff Software Engineer (Data Platform) - 3-6 months Contract

Remote · USA Full-time

Python Developer

Remote · USA Full-time

Senior Accountant

Remote · USA Full-time

Product Marketing Manager

Remote · USA Full-time

Elasticsearch Specialist

Remote · USA Full-time

Senior Backend Engineer

Remote · USA Full-time

Product Marketing Manager

Remote · USA Full-time

[Remote] Network reputed company Engineer

Remote · USA Full-time

Growth Partner Manager, UK (Contract)

Remote · USA Full-time

reputed company Part-time Chat Specialist for Automotive and Recreational Vehicle Sales, Service, and Finance – arenaflex – College Station, TX

Remote · USA Full-time

RN-Hospice Full time

Remote · USA Full-time

Sterile Processing Supervisor in Tulsa, OK

Remote · USA Full-time

Online Adjunct Faculty Sports Analytics and Visualization

Remote · USA Full-time

Senior Software Engineer – reputed company Near Me Application Development: Expert in Designing Scalable Software Solutions for Retail Innovation

Remote · USA Full-time

Entry Level Virtual Assistant - Remote | Work At Home

Remote · USA Full-time

reputed company Virtual Assistant - Data Entry Specialist for arenaflex: Work from Home Opportunity with Competitive Compensation and Career Growth

Remote · USA Full-time

Customer Service Representative/Merchandiser - Retail Excellence and Customer Delight

Remote · USA Full-time