About the Role

Cognilium AI is a production-first AI product engineering company building reliable, scalable AI systems for startups and enterprises. Founded in 2019, we specialize in agentic AI, enterprise-grade RAG/NL2SQL, voice AI, and cloud-native data platforms—engineered for real users, real scale, and measurable ROI. We don’t build demos. We ship AI that runs in production. We’re hiring an AI & Backend Engineer to design, build, and operate LLM-powered backend systems using Python and FastAPI. This is a hands-on, ownership-driven role focused on building real-world generative AI systems—from agentic workflows and RAG pipelines to scalable APIs deployed in live environments. You’ll work across the full lifecycle: architecture, implementation, deployment, observability, and optimization. Performance, cost control, and reliability are not afterthoughts—they’re core requirements. If you want to move fast, own systems end-to-end, and build AI that actually works in production, this role is for you.

Responsibilities

Design, build, and deploy end-to-end generative AI applications

Implement LLM-powered workflows including agentic and multi-step reasoning systems

Develop production-grade RAG pipelines with grounding, citations, and guardrails

Build high-performance asynchronous APIs using Python and FastAPI

Design and maintain scalable microservices exposing AI capabilities

Implement authentication, rate limiting, background jobs, and service boundaries

Integrate LLMs from OpenAI, Anthropic, and open-source providers

Design prompts optimized for accuracy, latency, and cost control

Work with vector databases for retrieval and semantic search

Design systems for scalability, fault tolerance, and security

Implement observability, structured logging, and monitoring across AI services

Deploy AI services to cloud environments with an AWS-first approach

Containerize applications using Docker and support CI/CD pipelines

Apply MLOps best practices including evaluation, monitoring, rollback, and cost governance

Collaborate with product, frontend, and data teams to deliver business outcomes

Participate in architecture reviews and technical decision-making

Requirements

Strong backend engineering experience with Python

Hands-on experience building APIs using FastAPI

Practical experience working with Large Language Models (LLMs)

Experience building generative AI or RAG-based systems

Solid understanding of RESTful API design and service architecture

0–1 year of professional experience or equivalent hands-on project experience

Ability to take ownership of systems from development to production

Nice to Have

+Experience deploying systems on AWS, GCP, or Azure

+Familiarity with LangChain, LlamaIndex, LangGraph, or CrewAI

+Experience with Docker, Kubernetes, and CI/CD pipelines

+Familiarity with vector databases such as Pinecone, Weaviate, Qdrant, Chroma, or OpenSearch

+Experience with SQL and/or NoSQL databases

+Exposure to voice AI or real-time systems

Benefits & Perks

Competitive market-aligned salary

Hands-on exposure to production-grade AI systems, not prototypes

High ownership and accelerated learning curve

Builder-led, engineering-first culture

Opportunity to work on cutting-edge AI systems with real business impact

Clear growth path in backend engineering and applied AI

AI & Backend Engineer – Python, FastAPI & Generative AI

About the Role

Responsibilities

Requirements

Nice to Have

Benefits & Perks