b'nerd GmbH b'nerd GmbH
de | en
Managed AI API Gateway · EU-hosted · GDPR-compliant

Private AI for your business — hosted in Europe

OpenAI-compatible endpoints, curated open-source models, and transparent credit-based billing. Pilot with Starter from 490€ / month and scale to Business and Enterprise — without sending data to US providers.

Pilot programs available — talk to us.

Hosting
Germany & EU
Compliance
GDPR-compliant
API
OpenAI-compatible

Use cases

Real AI, in the tools you already use

We don't just host models — we bring AI directly into your existing platforms, so your data stays where it lives.

Nextcloud AI

Document summarization, semantic search, and AI chat over your files — built into the Nextcloud you already operate.

GitLab AI

Code explanations, merge request summaries, and a private dev assistant — running inside your GitLab, not someone else's cloud.

Agentic coding & IDE assistants

A private endpoint for Claude Code, Cursor, Continue, and CLI agents. Your code stays in your environment.

Platform

Your private AI platform — fully managed

Built for regulated environments where data sovereignty and predictable operations matter more than benchmarks.

Privacy-first

GDPR-compliant by design. No data leaves your environment, and nothing is shared with public AI APIs.

EU & Germany hosting

Operated in European data centers. Choose Germany or another EU region depending on your compliance needs.

OpenAI-compatible API

Drop-in compatible endpoints. Point your existing tooling at our platform without rewriting your integrations.

Modern open-source models

Curated, production-grade open-source models. Updated and operated by us — no model sprawl on your side.

High-performance inference

Powered by modern AI infrastructure on H100-class hardware, tuned for real-world workloads — not benchmarks.

Transparent usage

Clear usage tracking and predictable cost structure. No surprise bills from token spikes.

Drop-in compatible

OpenAI-compatible — switch in 5 lines

Same SDKs, same endpoints. Point your base URL and API key at b'nerd and your existing tooling runs on private AI.

Example model name; we'll share the current set of available models on request.

Architecture

Built for control and transparency

A platform you can grow into — from a first pilot to production-grade AI features across your tools.

1
Shared infrastructure with clear isolation
Workloads run on shared platform infrastructure with strict tenant isolation and tier-based prioritization. No surprise model swaps, no foreign data in your inference.
2
Hosted in Europe
Operated in EU and German data centers, under European jurisdiction. Data residency is a deployment choice, not a footnote.
3
Open architecture
Open-source models, OpenAI-compatible API, and standard integrations — so you can move, swap, or self-host later. No vendor lock-in.
4
From Starter to Enterprise
Pilot with Starter, scale to Business for production load, move to Enterprise for governance, compliance, and custom SLAs.

Managed AI API Gateway

Pricing

Three packages for any workload. Token prices and the credit system apply uniformly across all tiers.

All prices excl. VAT. B2B only.

Which package fits you?

Starter

For smaller production workloads, internal assistants, RAG prototypes, and controlled API usage.

Business

For team and enterprise workloads with higher throughput, more stable usage, and prioritized processing.

Enterprise

For business-critical AI workloads with governance, compliance, integration, and custom operational requirements.

Starter

Prototypes & smaller production workloads.

490€ / month

Start a pilot

What's included

  • 20M credits / month
  • Shared best-effort AI infrastructure
  • OpenAI-compatible API
  • Standard queue priority
  • Standard API limits & context
  • Baseline monitoring
  • GDPR-compliant hosting in the EU
  • Availability: up to 99.5%
  • Email support
MOST POPULAR

Business

Prioritized processing for team workloads.

1490€ / month

Request a demo

What's included

  • 50M credits / month
  • Prioritized processing within shared infrastructure
  • Extended API limits
  • Higher requests / tokens per minute
  • Extended context limits
  • Full access to Premium models
  • Optional VPN access · SSO available
  • Monitoring & extended usage reporting
  • Availability: up to 99.9%
  • Email support · Slack Connect / Teams optional

Enterprise

Business-critical · Compliance · Custom.

from

2490€ / month

Talk to sales

What's included

  • Custom credit quotas
  • Highest priority within shared infrastructure
  • Custom API limits & concurrency
  • Extended context limits
  • Full access to Premium models
  • Private networking available
  • VPN / SSO integration
  • Audit logging & extended reporting
  • Custom model integrations optional
  • Custom SLA agreements
  • Priority support (email, phone, Slack Connect / Teams)

Token pricing

Prices per 1M tokens. Applies uniformly across all packages.

Standard
Chatbots · RAG · automations
Included usage 1.90€
On-demand 2.90€
Advanced
Coding assistants · agents · complex assistants
Included usage 4.90€
On-demand 6.90€
Premium
Reasoning · high-end AI · complex analysis
Included usage 9.90€
On-demand 14.90€

Credit system

The platform bills based on credits. Different model classes consume different amounts of credits per token.

  • Standard 1× credits
  • Advanced 3× credits
  • Premium 6× credits
Example
  • 10M Standard tokens = 10M credits
  • 2M Advanced tokens = 6M credits
  • 0.5M Premium tokens = 3M credits
Total usage: 19M credits

Credit calculator

Estimate your usage and see what each package would cost — including on-demand overage.

Estimated usage

Chatbots, RAG, automations

0

Coding, agents, assistants

0

Reasoning, high-end

0

Enter millions of tokens per month. Credit factors: Standard 1×, Advanced 3×, Premium 6×.

Packages compared

Total usage

0

Starter

Business

Enterprise

Request this package

Non-binding estimate. Usage beyond the included quota is billed at on-demand rates (allocated proportionally across model classes).

Term & prepayment

Longer terms or prepayment increase your credit quota — list prices stay the same.

Monthly

Standard
  • Standard pricing
  • Flexible usage
  • No minimum term

12-month commitment

+20% credits
  • +20% additional credits per month
  • Stable price baseline for 12 months

12-month prepayment

+30% credits
  • +30% additional credits per month
  • Prepaid — one invoice per year

Fair usage & performance

An API-first managed service: you focus on integration, we run the infrastructure and models. Tier-based limits keep performance stable under load.

Requests per minute

Tier-based RPM limits protect the platform and ensure predictable response times.

Tokens per minute

TPM limits scale with your package — Business and Enterprise have significantly higher throughput.

Context limits

Maximum context size per request, depending on package and model.

Queue prioritization

Prioritized lanes for Business and Enterprise keep response times stable under load.

Private AI — FAQs

Common questions about data residency, integrations, and the engagement model.

Do you have questions or would you like a personalized offer? We are happy to advise you.

Contact

Our cloud experts are happy to provide personalized advice.

Our Office

Sillemstraße 76A

20257 Hamburg, Deutschland

Mon - Fri: 09:00 AM - 06:00 PM

Telefon
+49 40 239 69 754 0
Email
hello@bnerd.com
Talk to us