Job Context:
Riseup Labs is looking for an LLM Engineer with expertise in NLP, STT, and TTS systems to design and optimize real-time conversational AI applications. You will work on building chatbots, voice assistants,
RAG systems, and generative AI apps
Job Responsibilities:
- Fine-tune and deploy LLMs (OpenAI, Anthropic, HuggingFace, Llama, Mistral, etc.) for real-world use cases.
- Build conversational AI systems with speech-to-text (Whisper, AWS
- Transcribe, Google STT) and text-to-speech (Azure TTS, ElevenLabs, Google TTS).
- Implement low-latency, real-time pipelines for voice and chatbot interactions.
- Design and optimize RAG (retrieval-augmented generation) with vector databases (Pinecone, FAISS, Weaviate, Milvus).
- Develop custom NLP modules (intent detection, summarization, translation, semantic search).
- Apply latency reduction techniques such as model quantization, caching, batching, and streaming inference.
- Collaborate with solution architects and backend engineers to integrate AI into web, mobile, and IoT applications.
- Monitor, test, and optimize model performance, cost, and response time.
- Stay updated on LLM, multimodal AI, and real-time voice AI research.
Educational Requirements:
- Bachelor’s/Master’s in Computer Science, AI/ML, or related field.
Additional Requirements:
- 3+ years of experience in NLP/ML, with at least 2+ year in LLM-based solutions.
- Strong programming skills in Python with PyTorch / TensorFlow.
- Hands-on experience with STT (Whisper, Deepgram, Vosk) and TTS (ElevenLabs, Coqui, Azure Speech, Google TTS).
- Experience in latency reduction (quantization, GPU optimization, distributed inference).
- Familiarity with LangChain, LlamaIndex, Rasa, or Botpress.
- Proficiency in cloud AI deployments (AWS Sagemaker, GCP Vertex AI, Azure ML).
- Knowledge of microservices, Docker, Kubernetes, and real-time streaming systems (Kafka, WebRTC, gRPC).
Nice to have:
- Experience with multimodal AI (speech + text + vision).
- Contributions to open-source STT/TTS or NLP projects.
- Background in edge AI for real-time, low-latency applications.
- Research experience in ASR (automatic speech recognition) or speech synthesis.
Workplace:
Salary:
- Negotiable(Based on experience and skills)
Compensation & Other Benefits:
- Annual Performance Evaluation and Increment
- Festival Bonus (2)
- Group Life and Health Insurance
- Full Subsidize Lunch
- Annual Retreats
- Wedding Bonus (As per company’s policy)
- Celebration of Events & Occasions
- Team Outing
- Training & Development by Organization Assigned Consultants
- Weekly 2 holidays (Sat & Sun)
- Paid Time Off 24 days (CL & SL)
- Maternity Leave with benefits (As per company's policy)
- Paternity Leave
- Public holidays as per Riseup Labs calendar
The Application Process:
- Telephone Round.
- Interview with the Tech Lead & Talent Acquisition Team.
- Final Interview with the HR Lead.
- Job Offer.