LLM Engineer

Remote

Full Time

Experienced

Company Description

Vyro is at the forefront of innovation, transforming content creation through advanced AI and Machine Learning technologies. As a rapidly growing Gen-AI and SaaS-focused company, we empower creativity across industries with state-of-the-art tools. Our flagship products include ImagineArt, an AI-powered design studio that turns text into stunning visuals, and Chatly, an intelligent multi-modal assistant leveraging frontier AI models for seamless task management and idea generation.

With 15+ products, over 2.5 billion images processed, and 800,000+ daily active users, Vyro is actively shaping the future of creative tools. Join our passionate team of Vyronauts to make an impact and innovate with us!

Role Description

This is a full-time, on-site role for an LLM Engineer based in Islamabad. The role involves designing, developing, and fine-tuning LLMs, building agentic AI workloads, implementing data-driven algorithms, and deploying scalable solutions. You will collaborate closely with cross-functional teams to integrate cutting-edge machine learning capabilities into Vyro’s products, while exploring new methods to enhance performance, reliability, and efficiency.

Qualifications

Experience & Education

4+ years of industry experience in Machine Learning or NLP
Bachelor’s degree in Computer Science (BSCS) or a related field

Frontier Model Orchestration

Deep experience leveraging closed-source SOTA models from OpenAI, Anthropic, Google, and xAI
Strong understanding of complex reasoning, tool-use, and multi-step AI pipelines

Advanced Architectures

Expert grasp of transformer variants and Mixture-of-Experts (MoE) architectures
Proven hands-on experience with open-weight SOTA models such as Llama 3.x, Mistral Large, Qwen 2.5, Phi-4, etc.

Agentic Frameworks

Mastery of multi-agent orchestration using frameworks like LangGraph (stateful agents), AutoGen, or CrewAI
Experience implementing DSPy for declarative, self-optimizing prompt pipelines

Production RAG & Memory Systems

Implementation experience with GraphRAG and hybrid retrieval strategies
Expertise with vector stores (Qdrant, Milvus, Weaviate) and semantic caching for long-term agent memory

Inference Optimization

Experience deploying high-throughput models using vLLM, TensorRT-LLM, or SGLang
Familiarity with FlashAttention-2, KV caching, and quantization techniques (AWQ, EXL2)

Why Join Us?

Work on innovative AI products like Chatly and ImagineArt that are shaping the future of user interaction and creativity
Collaborate with a passionate, talented team that values experimentation, innovation, and data-driven decision-making
Competitive salary and benefits package
A growth-driven culture that encourages learning, ownership, and continuous improvement

Note: This is an onsite position at our office in H12, Islamabad, for residents of Pakistan. Candidates residing outside of Pakistan may be considered for remote work opportunities.

Apply for this position

Required*

First Name*

Last Name*

Email Address*

Phone*

Resume*

We've received your resume. Click here to update it.

Attach resume or Paste resume

Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Are you willing to relocate?*

LinkedIn Profile URL:*

Desired salary*

Which open-source LLMs have you worked with directly?

Which closed-source frontier models (OpenAI, Anthropic, Google, xAI) have you integrated into production systems?*

Which open-weight SOTA models (Llama 3.x, Mistral Large, Qwen-2.5, Phi-4) have you deployed?

Drop your GitHub Link?

Human Check*

Submit Application

Join Us!