Jobtitel
    Stadt
      25 km

      AI Inference Platform Engineer (m/f/d)

      • FAU Erlangen-Nürnberg FAU Erlangen-Nürnberg
      • Erlangen Erlangen
      • Vollzeit Vollzeit
      • IT & Digitalisierung IT & Digitalisierung
      Stellenbeschreibung in Kürze
      Design and operate an AI inference platform with RAG capabilities, manage multi-tenant access and resources, and support university pilots from data preparation to production. Requires strong infrastructure skills, AI tooling experience, and English and German communication skills.

      Text wurde mit KI erstellt

      Über das Unternehmen
      • Hauptsitz: Erlangen, Deutschland
      • Über 40.000 Studierende
      • Gegründet 1743
      • Forschung und Lehre im Fokus
      • Vielfältige Studiengänge und Fachbereiche

      Text wurde mit KI erstellt

      Your Tasks

      Your Role and Responsibilities:

      • Designing, implementing and maintaining an AI inference platform based on predominantly open-source components including a web-based user interface and API, all within a friendly and open work environment in a highly motivated, international team
      • Conceptualizing and implementing infrastructure components to create a RAG-capable inference environment
      • Advising and supporting pilot project partners of select universities using this AI service infrastructure in data quality, data preparation and workflow design to contribute to the transfer of prototypes into production, relying on your friendly personality and communication skills
      • Designing and implementing tenant separation concepts for access, data and compute, integrating with federated single sign-on (SSO) institutional identity management systems
      • Implementing resource management mechanisms to ensure fair and efficient resource allocation and to allow for usage accounting and cost attribution

      Your Profile

      Required/Minimum Qualifications

      PhD or Master’s degree in computer or data science, or other areas of scientific computing,

      Other Requirements

      • Proficiency working in data center environments (incl. Linux, CLI, Git, Gitlab)
      • Extensive knowledge and experience in developing and maintaining platform environments in the context of AI inference workflows, that is utilizing e.g.
        • web server / load balancer (e.g. Nginx), data bases (e.g. MariaDB, SQLite, Redis)
        • containers/OS-level virtualization (e.g. Docker) and container orchestration (e.g. Kubernetes), as well as HPC-based scheduler (e.g. Slurm)
        • monitoring tools for metric collection (e.g. Prometheus) and visualization (e.g. Graphana)
        • Python and JavaScript programming languages for development of frontend components (e.g. Open WebUI)
        • model gateway (e.g. LiteLLM) and inference engines (e.g. vLLM, Triton, SGLang) as well as underlying GPU-based technologies (e.g. torch, ray)
      • Knowledge of various types of AI models (e.g. LLMs, vision-language models, …), model guardrails and retrieval-augmented generation (RAG)
      • Willingness to keep up with current developments and to learn new technologies in the field of AI
      • Knowledge and experience in software deployment and software lifecycle management (ideally based on principles of continuous integration/continuous deployment, CI/CD)
      • Basic knowledge and practical skills in software design and engineering
      • Basic knowledge and practical skills in IT and cyber security for software and software platform development
      • English and German presentation and writing skills

      Benefits: We Have a Lot To Offer

      • Regular promotion to the next level and increase in salary pursuant to the collective bargaining agreement for the public service of the German Länder (TV-L) or remuneration pursuant to the Bavarian Public Servants Remuneration Act (BayBesG) plus an additional annual bonus
      • 30 days annual leave at five working days per week with additional free days on December 24 and 31
      • Occupational pension scheme and asset accumulation savings scheme
      • Excellent support during the academic qualification phase
      • Thorough onboarding process with a dedicated team
      • Subsidized food and drinks in our student restaurants
      • Place of work within comfortable walking distance of public transport
      • Family-friendly environment with childcare options, also during school holidays
      • Flexible working hours
      • A wide range of training courses and opportunities for professional development
      • Active health management

      Payment

      TV-L E 13

      Veröffentlicht am 19 Mai 2026

      Diesen Job teilen
      Wie möchtest du diesen Job teilen? Link kopieren Als Mail versenden

      AI Inference Platform Engineer (m/f/d)

      • FAU Erlangen-Nürnberg FAU Erlangen-Nürnberg
      • Erlangen Erlangen
      Friedrich-Alexander-Universität Erlangen-Nürnberg
      Friedrich-Alexander-Universität ...
      Alle Jobs des Unternehmens
      • Sozialleistungen
      • Betriebskita
      • Weiterbildung
      • Attraktiver Standort
      • Tarifvertrag
      • Betriebskantine
      Keinen neuen Job mehr verpassen?
      Jetzt den Jobagenten abonnieren und über Neuigkeiten als erstes informiert werden!
      Der Jobagent versorgt dich per E-Mail mit neuen Stellenangeboten entsprechend deiner Suche und weiteren allgemeinen Informationen zur Job-Suche. Du kannst den Jobagenten selbstverständlich jederzeit wieder abbestellen.
      Loading

      Karriereguide

      Der schnellste Weg zur erfolgreichen Jobsuche & Karriere!
      Keinen neuen Job mehr verpassen?
      Jetzt den Jobagenten abonnieren und über Neuigkeiten als erstes informiert werden!
      Der Jobagent versorgt dich per E-Mail mit neuen Stellenangeboten entsprechend deiner Suche und weiteren allgemeinen Informationen zur Job-Suche. Du kannst den Jobagenten selbstverständlich jederzeit wieder abbestellen.
      Loading
      Keinen neuen Job mehr verpassen?
      Jetzt den Jobagenten abonnieren und über Neuigkeiten als erstes informiert werden!
      Der Jobagent versorgt dich per E-Mail mit neuen Stellenangeboten entsprechend deiner Suche und weiteren allgemeinen Informationen zur Job-Suche. Du kannst den Jobagenten selbstverständlich jederzeit wieder abbestellen.
      Loading