IONOS AI Model Hub

Your gateway to a sovereign multimodal AI platform

  • 100% GDPR-compliant and securely hosted in Europe
  • One platform for the most powerful AI models
  • No vendor lock-in with open source

Unleash the full power of AI safely and efficiently

The sovereign AI platform to access leading open source generative models for generating texts and images.

Sovereign and secure

Fully GDPR-compliant

  • Hosted exclusively in IONOS data centres across Europe
  • Your data remains yours and is never used for training
  • No third-party access ensures strict data sovereignty

Wide variety of use cases

Add AI to your applications

  • Analyse images, generate, summarise and translate text, or power conversational Q&A
  • Power semantic search, build recommendation engines, and cluster data
  • Generate visuals, mockups, and creative assets from text

Smarter answers

Vector search & RAG

  • Built-in vector database for similarity-based searches
  • Retrieval-augmented generation (RAG) tailors responses to your data
  • Supports multilingual embedding models for broader coverage

Serverless and scalable

Adapts to your needs

  • Pay only for what you use — per token
  • Runs smoothly from early tests to large-scale production
  • No need to manage infrastructure or operations

Leading models

Options for every use-case

  • Up to date with the latest and most reliable AI models
  • Best-of-breed international or European sovereign options
  • Access to open-source models with no licence fees

Seamless integration

Familiar interfaces

  • OpenAI-compatible REST API for a quick start
  • Works with standard frameworks and coding languages
  • Playground in the DCD for model testing

IONOS stands for secure and sustainable cloud solutions from Europe

Bring AI to life in few simple steps

  1. Query:
    Your query is sent to our powerful vector database to identify relevant information.

  2. Retrieve document:
    The vector database accesses the data and retrieves the most relevant documents for your query.

  3. LLM prompt:
    The retrieved documents are passed to your chosen large language model (LLM) to create a tailored response.

  4. Response:
    The response generated by the LLM is returned and ready for immediate use.

Fair pricing based on usage

Benefit from a token-per-use pricing model.
Large language models
Model
Price per 1 million input tokens (Price from 1 October 2025)
Price per 1 million output tokens (Price from 1 October 2025)
Standard

Llama 3.1 8B Instruct
Teuken-7B Instruct
Mistral Nemo Instruct New

£0.14
£0.14
Plus
Code Llama 13b Instruct HF
£0.41
£0.41

Mistral Small 24B Instruct New

£0.085
£0.26
Premium

gpt-oss-120b New

£0.14
£0.55
Llama 3.3 70B Instruct
£0.55
£0.55
Llama 3.1 405B Instruct
£1.58
£1.58
Standard
Model

Llama 3.1 8B Instruct
Teuken-7B Instruct
Mistral Nemo Instruct New

Price per 1 million input tokens (Price from 1 October 2025)
£0.14
Price per 1 million output tokens (Price from 1 October 2025)
£0.14
Plus
Model
Code Llama 13b Instruct HF
Price per 1 million input tokens (Price from 1 October 2025)
£0.41
Price per 1 million output tokens (Price from 1 October 2025)
£0.41
Model

Mistral Small 24B Instruct New

Price per 1 million input tokens (Price from 1 October 2025)
£0.085
Price per 1 million output tokens (Price from 1 October 2025)
£0.26
Premium
Model

gpt-oss-120b New

Price per 1 million input tokens (Price from 1 October 2025)
£0.14
Price per 1 million output tokens (Price from 1 October 2025)
£0.55
Model
Llama 3.3 70B Instruct
Price per 1 million input tokens (Price from 1 October 2025)
£0.55
Price per 1 million output tokens (Price from 1 October 2025)
£0.55
Model
Llama 3.1 405B Instruct
Price per 1 million input tokens (Price from 1 October 2025)
£1.58
Price per 1 million output tokens (Price from 1 October 2025)
£1.58
A token typically represents a unit of text processed by an AI model during inference. It can be a word, character, or another unit, depending on the model and language.
Text-to-image
Price per image from 1 October 2025
Stable Diffusion XL, FLUX.1
£0.0256
Stable Diffusion XL, FLUX.1
Price per image from 1 October 2025
£0.0256
Embedding Models
Convert any text or document into vector embeddings to power a full suite of AI applications, from similarity search and recommendation engines to RAG systems, whether you use our seamlessly integrated vector databases or your own external solution.
Price per 1 million tokens from 1 October 2025
paraphrase-multilingual-mpnet-base-v2
£0.01
bge-large-en-v1.5
£0.015
bge-m3
£0.02
paraphrase-multilingual-mpnet-base-v2
Price per 1 million tokens from 1 October 2025
£0.01
bge-large-en-v1.5
Price per 1 million tokens from 1 October 2025
£0.015
bge-m3
Price per 1 million tokens from 1 October 2025
£0.02
Data collections storage
Price per 1 million tokens
Storage

£0.009 / 30 days

Storage
Price per 1 million tokens

£0.009 / 30 days

Terms and conditions - Free of charge until 30 September 2025:

  • This campaign is valid from 1 December 2024 to 30 September 2025, and is available to both new and existing customers of the IONOS AI Model Hub during the campaign period.

  • Customers will receive a 100% discount on the usage of AI Model Hub, with the exception of ChromaDB storage which is excluded from this promotion and will be charged at the normal rate.

  • IONOS reserves the right to impose reasonable limits on usage to maintain the performance and integrity of the platform, and it may adjust these limits or restrict the inflow of new customers at its discretion to ensure optimal service quality. After the campaign period ends on 30 September 2025, all pricing will revert to the standard rates as published on the IONOS website.

  • This offer is non-transferable and cannot be combined with any other promotions or discounts. IONOS also reserves the right to modify or cancel this promotion at any time without prior notice. By participating in this campaign, customers agree to comply with all IONOS policies and guidelines.

The best AI for your applications in the IONOS Cloud

You're always a step ahead with the IONOS AI Model Hub.

Use case

Enrich text with AI-generated images

Generate engaging visuals from text prompts for better communication and impact.

  • Derive book covers based on a book’s abstract
  • Create eye-catching visuals to boost ad appeal
  • Increase visibility and engagement on social media

Use case

Create knowledge databases for patents

Build an application that provides patents or other confidential information within your company.

  • Import and store patents as plain text in a vector database
  • Find relevant material with a search string
  • Keep track of your data, always

Use case

Take your chatbot to the next level with AI

Use your company's internal knowledge base for chatbot-supported customer inquiries.

  • Export articles as text and import them into a vector database
  • Enhance customer engagement with quick, personalised responses
  • Let the IONOS AI Model Hub's LLMs handle valuable conversations

Documentation and guides

Getting started

Get a complete overview and maximise your data with the IONOS AI Model Hub's first steps for successful integration and use.

API interface

Integrate the desired LLMs into your applications with the IONOS AI Model Hub API and optimise them. Find out everything you need to know here.

Enrich your apps

  • Register for the IONOS Public Cloud
  • Get full access to the IONOS AI Model Hub API
  • Connect powerful AI models to your application

The IONOS Cloud in practice

Powerful and future-proof cloud infrastructure that many customers trust.

A variety of AI models with endless possibilities

Discover the advantages of our open-source-based AI solutions.

Generate high-quality content in seconds with an open-source language model from our AI Model Hub.

  • Texts written like a native
  • Creative writing and storytelling
  • Code generation

Generate high-quality content in seconds with an open-source language model from our AI Model Hub.

  • Texts written like a native
  • Creative writing and storytelling
  • Code generation

Reduce lengthy articles, paragraphs, or documents to their essential points with LLMs.

  • Summaries
  • Abstracts
  • Briefs

Reduce lengthy articles, paragraphs, or documents to their essential points with LLMs.

  • Summaries
  • Abstracts
  • Briefs

Use LLMs to select similar texts based on content or context and categorise them effectively.

  • Sentiment analysis
  • Topic labelling
  • Tagging content

Use LLMs to select similar texts based on content or context and categorise them effectively.

  • Sentiment analysis
  • Topic labelling
  • Tagging content

With the AI Model Hub, you can choose relevant texts based on predefined criteria and fine-tune the accuracy of search results.

  • Implement a relevance ranking
  • Enhanced query understanding with LLMs

With the AI Model Hub, you can choose relevant texts based on predefined criteria and fine-tune the accuracy of search results.

  • Implement a relevance ranking
  • Enhanced query understanding with LLMs

Store and process text elements more efficiently than with SQL or NoSQL databases using a vector database.

  • Identify relevant content for queries and searches
  • Provide content to an LLM for processing

Store and process text elements more efficiently than with SQL or NoSQL databases using a vector database.

  • Identify relevant content for queries and searches
  • Provide content to an LLM for processing

Benefit from precise and customised image generation with AI, creating unique visuals in no time.

  • Convert existing texts into visual representations
  • Create images based on text inputs

Benefit from precise and customised image generation with AI, creating unique visuals in no time.

  • Convert existing texts into visual representations
  • Create images based on text inputs
Questions about the IONOS Cloud?
Sales advice
0333 336 2984

Our product experts are here from 9:00–17:00, Monday to Friday.