← Back to Jobs

(Remote) Senior AI / Knowledge Graph Engineer (m/f/d)

Apply to this role
Pinnipedia Technologies GmbHBerlinFull-time
Posted on January 21, 1970Not specified

<p>Pinnipedia is a new Berlin startup building a cloud platform that automates and assists the creation of <strong>audit-ready IT-security concepts</strong> (e.g., BSI-Grundschutz, C5). We’re IGP-funded (2025/26) and co-develop with FU Berlin and pilot users from industry and security consulting.</p>

<p>We’re hiring an <strong>AI Engineer</strong> to turn messy inputs into structured knowledge and reliable answers.</p>

<p><strong>Your Mission</strong> -Own the end-to-end pipeline that turns unstructured documents into a validated, queryable knowledge graph. Accountable for extraction quality, graph integrity, and the data layer that backs the product's read path.</p>

<h2>Tasks</h2>

<p>• <strong>LLM extraction pipelines</strong> -document chunking, property and relationship extraction, cross-chunk reconciliation, gap detection. Built with structured-output LLM agents orchestrated by durable workflows.</p>

<p>• <strong>Knowledge graph</strong> -schema design as typed Pydantic models, Cypher access patterns and indexing strategy, graph operations, schema evolution and migration. Scope ends at the graph boundary: API contracts and query abstractions exposed to consumers belong to the full-stack engineer.</p>

<p>• <strong>Deterministic rule engines</strong> -table-driven evaluators for cases where code beats LLM judgment; clear contracts between deterministic and probabilistic components.</p>

<p>• <strong>Data validation &#x26; quality</strong> -schema enforcement, required-property contracts, audit trails, eval harnesses (expert review, unsupervised checks, synthetic fixtures, LLM-as-judge).</p>

<p>• <strong>Live data ops</strong> -backfills, coordinated migrations across relational + graph stores, observability on extraction throughput and quality, incident response.</p>

<h2>Requirements</h2>

<p><strong>Must-have</strong></p>

<ul>

<li>5+ years shipping data/AI systems to production with real customers -has been on-call for live pipelines and knows what breaks at 2am.</li>

<li>Strong Python (typed, modern) and SQL. Comfortable with PostgreSQL under load.</li>

<li>Production experience with at least one graph database (Neo4j preferred; Neptune, ArangoDB, TigerGraph acceptable) -schema design, query tuning, not toy use.</li>

<li>Production LLM pipeline experience: structured output, agent orchestration, prompt and version management, evaluation frameworks. PydanticAI, LangChain, DSPy, or Instructor all welcome.</li>

<li>Durable workflow orchestration in production (DBOS, Temporal, Airflow, Prefect, Dagster).</li>

<li>Test-first discipline -integration tests against real datastores (Testcontainers or equivalent), not mock-heavy unit tests.</li>

<li>Fluent English skills.</li>

</ul>

<p><strong>Nice-to-have</strong></p>

<ul>

<li>Experience with regulated, compliance-driven, or standards-heavy extraction domains (legal, medical, financial, security/audit).</li>

<li>Designed deterministic evaluators alongside LLM components and knows when to reach for which.</li>

<li>Contributions to data contracts, schema governance, or ontology work.</li>

<li>German language skills.</li>

</ul>

<h2>Benefits</h2>

<p><strong>Remote, full-time</strong> with flexible scheduling. <strong>CET (Berlin) timezone availability expected.</strong></p>

<p>Possibility of relocation if successfull work relationship is achieved after a period of time.</p>

<p><strong>Competitive salary: 32.000–42.000 €</strong> base (premium for exceptional senior profiles).</p>

<p>Small, focused team; direct collaboration with the Product Owner and Full-Stack Engineer.</p>

<p>Modern tooling, real ownership, and a learning budget for role-relevant training.</p>

<p>Impact: help SMEs meet rising security requirements with less friction.</p>

<p><strong>Apply on JOIN</strong> with your CV (PDF) and a short note (max <strong>200 words</strong>) describing <strong>how you would design a KG-backed RAG pipeline</strong> (ontology scope, indexing, retrieval, and evaluation you’d use).<br>

<strong>Process:</strong> 20-min intro → 90-min practical (graph modeling + retrieval evaluation) → 45-min team chat → references. We review applications within <strong>5 business days</strong>.</p>

<p>Find <a href="https://www.arbeitnow.com/">Jobs in Germany</a> on Arbeitnow</a>