Cynthia is in direct contact with the company and can answer any questions you may have. Email
We are a social enterprise combining two entities: Voilà, a Canadian non-profit organization, and ConnectED Labs, a tech startup. Together, we deliver high-quality, web-based, 3D immersive learning environments in the metaverse, creating dynamic and interactive experiences.
Our mission is to build accessible, interactive, and cost-effective education and professional development solutions. By leveraging innovative technology, we foster global connections and simulate real-world learning experiences. Our scalable solutions enhance learning for corporations worldwide, equipping organizations with tools to improve onboarding, growth, and
upskilling.
We are seeking an AI/ML Engineer to develop and integrate Agentic AI-powered avatars into our educational platform. This role involves leveraging DeepSeek, Generative AI, NLP, Speech-to-Text (STT), and Text-to-Speech (TTS) technologies to create a seamless and engaging experience for students, teachers and tutors.
The ideal candidate will be experienced in building low-latency AI systems, optimizing real-time AI interactions, and working with LLMs in production, with a focus on multi-step reasoning and autonomous decision-making workflows.
● Develop and optimize AI models for Agentic workflows, enabling the AI to reason, plan, and retrieve data dynamically.
● Integrate Speech-to-Text (STT) (Whisper API, Google STT) and Text-to-Speech (TTS) (ElevenLabs, Google TTS).
● Implement long-term AI memory using vector databases (Pinecone, Weaviate) for persistent conversation recall.
● Build multi-step reasoning pipelines, ensuring the AI can autonomously refine responses and ask follow-up questions.
● Integrate API calls for autonomous information retrieval (e.g., Wikipedia, Google Search API, or internal knowledge sources).
● Deploy and manage AI models in a scalable microservice architecture (GKE/Kubernetes).
● Optimize AI inference performance to achieve real-time interactions with sub-1-second latency.
● Develop APIs to interact with the AI agent in real-time, supporting WebSockets for streaming responses.
● Collaborate with the Unity team to integrate AI-driven conversational avatars with real-time behavior.
● Research and experiment with DeepSeek to optimize LLM responses for dynamic tutoring.
● Bachelor’s or Master’s in Computer Science, AI, Data Science, or a related field.
● 3+ years of experience in AI/ML model development and deployment.
● Strong experience with NLP, LLMs, and Generative AI (DeepSeek, OpenAI APIs, Gemini, Llama3, etc.).
● Proficiency in Python, TensorFlow, PyTorch, or JAX.
● Experience with FastAPI, Flask, or Node.js for AI microservices.
● Hands-on experience with STT and TTS technologies.
● Experience developing multi-step reasoning workflows for AI.
● Familiarity with vector databases (Pinecone, Weaviate, FAISS).
● Experience deploying AI models in GCP (Vertex AI is a plus).
● DeepSeek implementation experience in real-time applications.
● Experience fine-tuning LLMs with custom datasets.
● Understanding of Agentic AI architectures, such as chain-of-thought reasoning.
● Experience with Kubernetes, Docker, and scalable microservices.
● Familiarity with WebSockets for real-time AI interactions.