AIFull-time

AI & Backend Engineer – Python, FastAPI & Generative AI

AI
Lahore
Full-time
PKR 50,000 - 120,000 / month
Posted: December 17, 2025|Apply by: February 15, 2026

About the Role

Cognilium AI is a production-first AI product engineering company building reliable, scalable AI systems for startups and enterprises. Founded in 2019, we specialize in agentic AI, enterprise-grade RAG/NL2SQL, voice AI, and cloud-native data platforms—engineered for real users, real scale, and measurable ROI. We don’t build demos. We ship AI that runs in production. We’re hiring an AI & Backend Engineer to design, build, and operate LLM-powered backend systems using Python and FastAPI. This is a hands-on, ownership-driven role focused on building real-world generative AI systems—from agentic workflows and RAG pipelines to scalable APIs deployed in live environments. You’ll work across the full lifecycle: architecture, implementation, deployment, observability, and optimization. Performance, cost control, and reliability are not afterthoughts—they’re core requirements. If you want to move fast, own systems end-to-end, and build AI that actually works in production, this role is for you.

Responsibilities

  • Design, build, and deploy end-to-end generative AI applications
  • Implement LLM-powered workflows including agentic and multi-step reasoning systems
  • Develop production-grade RAG pipelines with grounding, citations, and guardrails
  • Build high-performance asynchronous APIs using Python and FastAPI
  • Design and maintain scalable microservices exposing AI capabilities
  • Implement authentication, rate limiting, background jobs, and service boundaries
  • Integrate LLMs from OpenAI, Anthropic, and open-source providers
  • Design prompts optimized for accuracy, latency, and cost control
  • Work with vector databases for retrieval and semantic search
  • Design systems for scalability, fault tolerance, and security
  • Implement observability, structured logging, and monitoring across AI services
  • Deploy AI services to cloud environments with an AWS-first approach
  • Containerize applications using Docker and support CI/CD pipelines
  • Apply MLOps best practices including evaluation, monitoring, rollback, and cost governance
  • Collaborate with product, frontend, and data teams to deliver business outcomes
  • Participate in architecture reviews and technical decision-making

Requirements

  • 1
    Strong backend engineering experience with Python
  • 2
    Hands-on experience building APIs using FastAPI
  • 3
    Practical experience working with Large Language Models (LLMs)
  • 4
    Experience building generative AI or RAG-based systems
  • 5
    Solid understanding of RESTful API design and service architecture
  • 6
    0–1 year of professional experience or equivalent hands-on project experience
  • 7
    Ability to take ownership of systems from development to production

Nice to Have

  • +Experience deploying systems on AWS, GCP, or Azure
  • +Familiarity with LangChain, LlamaIndex, LangGraph, or CrewAI
  • +Experience with Docker, Kubernetes, and CI/CD pipelines
  • +Familiarity with vector databases such as Pinecone, Weaviate, Qdrant, Chroma, or OpenSearch
  • +Experience with SQL and/or NoSQL databases
  • +Exposure to voice AI or real-time systems

Benefits & Perks

Competitive market-aligned salary
Hands-on exposure to production-grade AI systems, not prototypes
High ownership and accelerated learning curve
Builder-led, engineering-first culture
Opportunity to work on cutting-edge AI systems with real business impact
Clear growth path in backend engineering and applied AI

Interested in this role?

Click below to apply through our application form