LLM Engineer

Job Type: Full time No. of Vacancies: 1 Experience: 3+ Years
Apply Now
Riseup Labs

 

Linkedin Button

Riseup Labs Google Review

Job Context:

Riseup Labs is looking for an LLM Engineer with expertise in NLP, STT, and TTS systems to design and optimize real-time conversational AI applications. You will work on building chatbots, voice assistants,
RAG systems, and generative AI apps

Job Responsibilities:

  • Fine-tune and deploy LLMs (OpenAI, Anthropic, HuggingFace, Llama, Mistral, etc.) for real-world use cases.
  • Build conversational AI systems with speech-to-text (Whisper, AWS
  • Transcribe, Google STT) and text-to-speech (Azure TTS, ElevenLabs, Google TTS).
  • Implement low-latency, real-time pipelines for voice and chatbot interactions.
  • Design and optimize RAG (retrieval-augmented generation) with vector databases (Pinecone, FAISS, Weaviate, Milvus).
  • Develop custom NLP modules (intent detection, summarization, translation, semantic search).
  • Apply latency reduction techniques such as model quantization, caching, batching, and streaming inference.
  • Collaborate with solution architects and backend engineers to integrate AI into web, mobile, and IoT applications.
  • Monitor, test, and optimize model performance, cost, and response time.
  • Stay updated on LLM, multimodal AI, and real-time voice AI research.

Educational Requirements:

  • Bachelor’s/Master’s in Computer Science, AI/ML, or related field.

Additional Requirements:

  • 3+ years of experience in NLP/ML, with at least 2+ year in LLM-based solutions.
  • Strong programming skills in Python with PyTorch / TensorFlow.
  • Hands-on experience with STT (Whisper, Deepgram, Vosk) and TTS (ElevenLabs, Coqui, Azure Speech, Google TTS).
  • Experience in latency reduction (quantization, GPU optimization, distributed inference).
  • Familiarity with LangChain, LlamaIndex, Rasa, or Botpress.
  • Proficiency in cloud AI deployments (AWS Sagemaker, GCP Vertex AI, Azure ML).
  • Knowledge of microservices, Docker, Kubernetes, and real-time streaming systems (Kafka, WebRTC, gRPC).

Nice to have:

  • Experience with multimodal AI (speech + text + vision).
  • Contributions to open-source STT/TTS or NLP projects.
  • Background in edge AI for real-time, low-latency applications.
  • Research experience in ASR (automatic speech recognition) or speech synthesis.

Workplace: 

  • Uttara, Dhaka

Salary: 

  • Negotiable(Based on experience and skills)

Compensation & Other Benefits:

  • Annual Performance Evaluation and Increment
  • Festival Bonus (2)
  • Group Life and Health Insurance
  • Full Subsidize Lunch
  • Annual Retreats
  • Wedding Bonus (As per company’s policy)
  • Celebration of Events & Occasions
  • Team Outing
  • Training & Development by Organization Assigned Consultants
  • Weekly 2 holidays (Sat & Sun)
  • Paid Time Off 24 days (CL & SL)
  • Maternity Leave with benefits (As per company's policy)
  • Paternity Leave
  • Public holidays as per Riseup Labs calendar

The Application Process:

  • Telephone Round.
  • Interview with the Tech Lead & Talent Acquisition Team.
  • Final Interview with the HR Lead.
  • Job Offer.
Apply Now