We are looking for an AI Engineer with hands-on experience in Large Language Models (LLMs) to integrate intelligent features into our software product. This role focuses on Java backend development and requires expertise in both cloud and offline/on-premises AI solutions.

Key Responsibilities:

Integrate LLMs (e.g., GPT-4, Falcon, LLaMA, Mixtral) into Java backend systems.
Develop local services (in Python or Java) to serve offline models when needed.
Design and maintain REST/JSON endpoints for communication between Java services and AI modules.
Personalize and adapt model outputs through prompt engineering.
Implement logic for natural language understanding, question/answer generation, and response analysis.
Support hybrid architecture: cloud-first with fallback or dedicated on-premises mode.
Ensure data privacy, performance, and security in AI integrations.
Collaborate with backend, frontend (Angular), and product teams for seamless integration.

Qualifications

Required Skills & Experience:

Experience with LLMs (e.g., GPT, Falcon, LLaMA, BloomZ).
Experience integrating APIs (OpenAI, HuggingFace, Ollama).
Strong Python and Java skills for backend development (FastAPI, Flask).
Expertise in Java backend development, especially with Spring Boot.
Familiarity with AWS services (API Gateway, EC2, Lambda, etc.).
Experience deploying AI models in on-premises environments.
Familiar with model quantization and serving tools (HuggingFace, llama.cpp, Ollama).

Nice to Have:

Familiarity with LangChain, vLLM, or Retrieval-Augmented Generation (RAG).
Experience with multilingual prompt engineering.
Working knowledge of Angular.
Experience with AI solutions in offline enterprise environments.
Knowledge of privacy regulations (e.g., GDPR) and edge computing best practices.

Who You Are:

Solution-oriented, with strong problem-solving skills.
Comfortable working autonomously and taking technical ownership.
Eager to collaborate with cross-functional teams.
Curious and passionate about exploring new AI technologies.

Additional Information

What We Offer:

An innovative product focused on real-world Generative AI.
Influence in technical decisions and solution architecture.
Flexible, remote work with autonomy.
Growth opportunities with modern tools and open-source models.

If you’re excited about making an impact in the AI space, we’d love to hear from you! Apply now and join our dynamic team.

About Penguin Formula

👥51-200

📍Lisbon

🔗Website

Penguin Formula Service

How does Penguin Formula work?

Our teams are experts in the development, customization, and integration of enterprise web solutions. Our core skills are formed of Java and JavaScript but our passion for IT and our ability to combine a wide variety of emerging languages and open-source tools enable us to expand our know-how to a wide range of technologies.

Company culture

Visit company profile

Unlock all Arc benefits!

Browse remote jobs in one place
Land interviews more quickly
Get hands-on recruiter support

PRODUCTS

Arc

The remote career platform for talent

Codementor

Find a mentor to help you in real time

LINKS

About us Pricing Arc Careers - Hiring Now!Remote Junior Jobs Remote jobs Career Success Stories Talent Career Blog Arc Newsletter

JOBS BY EXPERTISE

Remote Front End Developer Jobs Remote Back End Developer Jobs Remote Full Stack Developer Jobs Remote Mobile Developer Jobs Remote Data Scientist Jobs Remote Game Developer Jobs Remote Data Engineer Jobs Remote Programming Jobs Remote Design Jobs Remote Marketing Jobs Remote Product Manager Jobs Remote Project Manager Jobs Remote Administrative Support Jobs

JOBS BY TECH STACKS

Remote AWS Developer Jobs Remote Java Developer Jobs Remote Javascript Developer Jobs Remote Python Developer Jobs Remote React Developer Jobs Remote Shopify Developer Jobs Remote SQL Developer Jobs Remote Unity Developer Jobs Remote Wordpress Developer Jobs Remote Web Development Jobs Remote Motion Graphic Jobs Remote SEO Jobs Remote AI Jobs

Cookie Policy Privacy Policy Terms of Service