AI Engineer - LLM Architect Job at Musing Ai, Pittsburgh, PA

T29jR2VWN3Azbi9uWGFFaFNVVEhEM2xCZ0E9PQ==
  • Musing Ai
  • Pittsburgh, PA

Job Description


AI Engineer (LLM Architect), Emotional Companion

 

The Role:

This position will help design and ship an emotionally intelligent conversational companion that reduces loneliness and improves daily life for older adults. You will architect the end-to-end AI stack, move fast with real users, and set the technical bar for the team.

What you will do :

  • Architecture : Design the conversational system from intake to response. Own policy, generation, tool use, long-term memory, personalization, and retrieval.
  • Model selection and training : Choose base models, build data pipelines, and run instruction tuning, safety tuning, and preference optimization. Use techniques such as LoRA, DPO, distillation, and quantization to reach latency and cost targets.
  • Prompt and agent design : Create robust system prompts, function-calling schemas, and tool APIs. Stand up an A/B framework to test prompts, policies, and safety rules with real users.
  • Evaluation : Build an automated and human-in-the-loop eval harness for empathy, helpfulness, safety, groundedness, latency, and cost. Define success metrics and wire them into dashboards.
  • Safety and ethics : Implement guardrails for prompt injection, jailbreaks, self-harm, medical boundaries, and misinformation. Add escalation, deflection, and human handoff paths that respect user consent.
  • Data and privacy : Set standards for PII handling, redaction, consent management, anonymization, and secure storage. Curate, generate, and label data that reflects diverse seniors and scenarios.
  • Serving and MLOps : Ship models to production using efficient inference stacks. Add observability, tracing, rollback, canary releases, and a model registry. Keep the system fast, stable, and affordable.
  • Voice pipeline : Integrate ASR, TTS with expressive prosody, barge-in, turn-taking, and latency budgets for a natural feel.
  • Collaboration : Work with design and research to translate user studies into product requirements. Mentor teammates and help make pragmatic build-vs-buy decisions.

Required skills & experience :

  • Deep Python : Production-grade code, profiling, testing, and packaging.
  • LLM implementation : Strong PyTorch and experience training or fine-tuning open models (e.g., Llama, Mistral, Qwen) including tokenizer issues, data curation, and distributed training with FSDP or DeepSpeed.
  • Inference and optimization : Quantization (GGUF, GPTQ, AWQ), serving stacks (vLLM, TensorRT-LLM, llama.cpp), caching, KV-reuse, streaming, and throughput tuning.
  • Prompt engineering and tool use : System and developer prompts, function calling, tool orchestration, and failure handling. Ability to make prompts measurable and testable.
  • Retrieval-augmented generation : Indexing, chunking, reranking, and grounding. Experience with FAISS, Milvus, Vespa, or Pinecone. Understanding of hallucination mitigation.
  • Evaluation and experimentation : Human ratings at scale, rubric design for empathy and safety, statistical testing, online A/B. Comfort turning qualitative findings into quantitative KPIs.
  • Security and privacy : PII handling, threat modeling for LLMs, prompt-level defenses, rate limiting, abuse detection. Familiarity with HIPAA-adjacent expectations and SOC 2 practices.
  • Product mindset : Ability to ship thin slices, instrument them, and iterate quickly based on user feedback.

Nice-to-have :

  • Affective computing : Emotion and intent classifiers, prosody features, conversation state tracking, de-escalation strategies.
  • Speech : ASR, diarization, VAD, latency-aware pipelines, expressive TTS.
  • Reinforcement and preference learning : DPO, PPO, ORPO, reward modeling, red-teaming loops.
  • On-device and edge : GPU and CPU constraints, memory mapping, mixed precision, mobile or embedded deployment.
  • Compliance awareness : Experience in healthcare or aging tech, consent UX, accessibility standards.
  • HCI and conversation design : Persona, turn-taking, long-term rapport, and evaluation methods suited for vulnerable users.

What success looks like in 90 days :

  • A production-ready conversational MVP with safety guardrails and memory that passes internal red-team checks.
  • An eval harness with live dashboards for empathy, safety, groundedness, latency, and cost per session.
  • A prompt and policy library with A/B tests running weekly and clear learnings.
  • A data pipeline with redaction, consent flags, and a high-quality instruction-tuning set sourced from real use.

Tools you might use:

 

Python, PyTorch, vLLM or TensorRT-LLM, llama.cpp, Weights & Biases, Ray, FAISS or Milvus, Redis, Postgres, Kubeflow or Flyte, Grafana or OpenTelemetry, Whisper or similar ASR, high-quality TTS, and standard MLOps tooling.

 

About us:

We are an exciting, new (funded) and stealthy AI startup that focuses on addressing the negative effects of isolation. You will be working with a group of experienced tech entrepreneurs and AI technologists. This position will help design and ship an emotionally intelligent conversational companion that reduces loneliness and improves daily life for older adults. You will architect the end-to-end AI stack, move fast with real users, and set the technical bar for the team.

 

What we offer:

  • Competitive base salary
  • Cash bonus 
  • Equity stack 
  • Unlimited PTO Plan
  • Dental, Vision, and Health Insurance
  • Hybrid Work Schedule in Pittsburgh, PA
  • We sponsor OPT and STEM OPT only

Job Tags

Full time,

Similar Jobs

Walt Disney Animation Studios

Walt Disney Animation Studios Character Technical Director Intern, Summer 2026 Job at Walt Disney Animation Studios

 ...how your skills and talents could translate into the world of animation? Our Internship Program dives into the craft, technology, and operations...  ...team and how your unique talents can grow within our studio. Learn more about our filmmaking process here . Find yourself... 

Learning Arts

BCBA Job at Learning Arts

 ...is a hybrid position that blends the best of in-clinic, in-home, and work-from-home opportunitiesgiving you flexibility while maximizing...  ...Required Skills current Board-Certified Behavior Analyst (BCBA) Certification Master's degree in Applied Behavior Analysis... 

Sanford Health

LPN - Neurology - PRN Job at Sanford Health

 ...Nursing Featured: No By applying, you consent to your information being transmitted by College Recruiter to the Employer, as data controller, through the Employers data processor SonicJobs. See Sanford Health Terms & Conditions at legal/#job-seeker-terms-of-... 

Allied Universal®

Security Professional - Manufacturing Patrol Job at Allied Universal®

 ...Job Description Allied Universal, North Americas leading security and facility services company, offers rewarding careers that provide...  ...eligibility. As a Security Professional - Manufacturing Patrol in Oshkosh, WI , you will serve and safeguard clients in a... 

Engbrecht Agency Staffing

Work from Home Sales Career — Full Training & Support Job at Engbrecht Agency Staffing

 ...for driven individuals ready to start a career in remote life insurance sales. Why Join Us? Qualified inbound leads no cold...  ...Requirements USA residency required Desire to serve and help families Life/health insurance license (or willingness to earn one)...