4 slots Q2/26

AI CONSULTING · VIENNA · SINCE 2023

AI consulting
that calculates, not sells hype.

We advise DACH mid-market on AI: where it actually creates leverage, which tool stack fits the use case, how data protection and the EU AI Act are addressed cleanly — and when a project simply isn’t ready yet. Hands-on since the first GPT generation in 2023.

Trusted by
Mercedes-Benz
BMW Deutschland
MINI Deutschland
Red Bull
IKEA
Allianz
A1
Sky
Electronic Arts
OBI

What AI consulting in mid-market actually means.

AI consulting covers the strategic and technical guidance for productive use of large language models, machine learning, agent architectures and automation workflows in a company.

It typically covers the following phases:

  • Sober use-case assessment — which processes actually have leverage, which are marketing hype
  • Tool and model selection (US frontier models vs. EU-hosted open-source models)
  • Pilot implementation with a clearly defined success criterion
  • Integration into existing systems (ERP, CRM, shop, databases)
  • Governance & data-protection setup (GDPR, EU AI Act, logging discipline)
  • Team training for sustainable use

Unlike pure "AI workshops" or tool-licence brokerage, we own the full arc — from the first hypothesis to a productive, monitored application with a clear business case.

What to expect from serious AI consulting.

Four things that distinguish serious AI consulting from hype workshops and tool brokerage — and which we deliver in every engagement.

01

Sober use-case assessment instead of AI hype

We actively say when a use case is better solved with classic software, an SQL query or a slim script than with an LLM. Not every request needs GPT-5; some questions are "AI cases" only because the buzzword unlocks budget. You get an honest recommendation — even when it runs counter to our short-term revenue interest.

02

Privacy-first architecture instead of US-cloud default

For each use case we assess where a US frontier model (OpenAI, Claude) makes sense — and where an EU-hosted open-source model (Mistral, Llama, Qwen on Hetzner Frankfurt or Fly.io Frankfurt) is more economical and more privacy-friendly. For sensitive data we rely on on-premise or dedicated inference setups. Data flows, logging discipline and model choice belong to the architecture, not to a later compliance retrofit.

03

Hands-on since 2023 — we build, not just PowerPoint

We have used LLMs productively since the first GPT generation — in our own tools, client projects, and in our own shop operations. RAG pipelines, agent setups with tool use, structured output parsing, eval strategies against hallucinations, cost monitoring at token level: these are topics we’ve seen in real production workloads — not just demos. You get advice from people who afterwards write the code.

04

Model- and vendor-agnostic — no lock-in

We do not recommend a tool because we receive a commission, but because it fits the use case. Applications are built model-agnostic (abstracted via LiteLLM, OpenRouter, or own wrappers) — if a better or cheaper model arrives tomorrow, you switch without re-architecting. Licences, contracts and data belong to you, not to us.

AI consulting at clickpuls — what we concretely do.

Use-case discovery: where AI actually creates leverage in mid-market

The most common misjudgement in mid-market: treating AI as a universal tool that lifts efficiency everywhere.

High-value use cases usually sit in three places:

  • Recurring knowledge work with unstructured input — offer drafting, contract analysis, supplier correspondence, support triage
  • Data extraction from documents, emails, PDFs — invoice capture, delivery-note processing, master-data maintenance
  • Research and synthesis tasks with provable sources — RAG over internal documents, competitor monitoring, market briefings

What is rarely economical:

  • AI as a replacement for well-defined ETL pipelines
  • AI for tasks with high hallucination risk and hard compliance
  • AI as pure chatbot gimmickry without connection to real data

Our discovery format: two workshop half-days with the operational people — not with the steering committee. We walk through real daily routines (which emails arrive, which Excel sheets are opened daily, which reports are manually compiled) and assess each candidate along four dimensions: frequency, complexity, fault tolerance and data availability.

Output is a prioritised roadmap — typically 3–5 quick wins for the next 8 weeks plus 2–3 strategic cases for 6–12 months, each with rough effort and a business-case hypothesis.

LLM, RAG, agent or classical ML — the right architecture per case

Not every task is a prompt problem. We cleanly separate four architecture classes.

Four architecture classes:

  • Pure LLM tasks (rewriting, translating, summarising) — fast and cheap, little engineering
  • RAG (retrieval-augmented generation) — when the model should access internal documents, product data or knowledge base without hallucinating. Stack: vector database (pgvector, Qdrant), embedding model (OpenAI, BGE, Cohere), hybrid retrieval (BM25 + semantic), re-ranking, clear source attribution
  • Agent setups with tool use — when the model should perform multi-step actions (query data, write emails, trigger ERP postings). Here tool definition, eval loops and guardrails matter
  • Classical ML (forecasting, classification, anomaly detection) — when the task has structured inputs and reproducible outputs, a gradient-boosted tree is often better, faster and cheaper than an LLM

We choose the architecture by task, not by tool trend. Frequent market mistake: using agentic frameworks (LangChain, LangGraph, CrewAI) for tasks solvable with two API calls and an if-condition — the result is brittle systems with high token costs. We build as little complexity as necessary.

Tool and model stack: what we recommend when

Frontier models (US cloud) — OpenAI GPT-5, Anthropic Claude Opus 4.7, Google Gemini 2.5 — we recommend when the task needs maximum reasoning depth, the data flow is legally workable (DPA, EU region if available, consent clarity) and the token budget fits.

EU / open-source models — Mistral Large / Codestral (French hosting), Llama 3.3 / Llama 4 on own infrastructure, Qwen2.5 for multilingual workloads — we recommend when data residency is hard-required, token volume economically argues for self-hosting (typically from a few million tokens/month), or when model stability over years matters more than frontier reasoning.

Stack components by use case:

  • Embeddings and re-ranking: OpenAI text-embedding-3, BGE-M3, Cohere Rerank — depending on language and domain
  • Vector storage: pgvector in Postgres (sufficient for most use cases, no extra system), Qdrant (when real hybrid search and large corpora are needed), or Elasticsearch (if you operate it already)
  • Agent orchestration: native tool use of the models (OpenAI function calling, Anthropic tool use) as default, LangGraph only when complexity really justifies it

Hosting of our inference containers and vector DBs runs on Hetzner Frankfurt or Fly.io Frankfurt — data processing within the EU.

Data protection, EU AI Act and governance — implemented cleanly on the technical side

We cleanly implement the technical prerequisites for GDPR-compliant and EU-AI-Act-compatible AI operation — the legally binding assessment, however, explicitly belongs to your lawyer or DPO.

What we implement technically:

  • EU hosting of applications and vector DBs (Hetzner, Fly.io, Frankfurt)
  • Logging discipline — which inputs/outputs are stored for eval, which are not
  • Data classification before the model call — PII masking, secrets filtering
  • Data-processing agreements with all model providers
  • Clear separation between training and inference data — no training on customer data without explicit opt-in

On the EU AI Act: since February 2025 the bans on "unacceptable" systems are in force, from August 2025 the duties for general-purpose AI models, from August 2026 the main duties for high-risk systems.

In mid-market the most common practical question is: which of our AI applications fall into high-risk categories?

  • High-risk (typical): HR screening, credit scoring, biometric identification, education
  • Minimal risk (typical): internal productivity tools, marketing copy, translation

We deliver a pragmatic classification of your use cases plus the technical building blocks for the duties (risk management, logging, transparency notices, human oversight). The final compliance stamp is set by your DPO or lawyer — we deliver the technical template.

Integration into existing systems — AI is not an island tool

An AI application only creates leverage when connected to the real data flows.

Existing systems we integrate LLM-powered workflows with:

  • ERP / accounting: SAP Business One, Microsoft Dynamics, BMD, DATEV
  • CRM: Salesforce, HubSpot, Pipedrive
  • E-commerce: Shopify, WooCommerce
  • Productivity / collaboration: M365 (Outlook, Teams, SharePoint), Google Workspace, Atlassian (Jira, Confluence)

Connected via REST, GraphQL, webhooks or native connectors.

Typical architecture pattern: a slim integration service (Node.js on Fly.io, Python on Hetzner) as mediator between existing system and model API, with idempotency, retry logic and clean error tracking via Sentry.

For sensitive workflows: an approval step with a human in the loop before the model writes into a system — we don’t sell you the "agent does everything autonomously" fairy tale.

What sets us apart — practice from our own operations

Over the years we have built and operated numerous own shops and digital products — dropshipping models, our own warehouse with product development, our own brands with assortment build-up and marketing.

Since 2023 we have used AI productively in our own operations:

  • Product copy generation with brand-voice consistency
  • Supplier email triage
  • Customer-support pre-classification
  • Competitive briefings via RAG over public sources
  • Internal SOP search

This experience — what actually holds up in a running mid-market operation and what fails on the friction of reality — flows into every engagement.

Our team combines three disciplines:

  • Software and AI engineering
  • E-commerce strategy and platform know-how
  • Brand and marketing practice

You talk directly with the seniors who afterwards build the system, set up the eval pipeline and write the cost reports — not through three layers of account management that "sell" AI consulting and then hand it down to junior implementers.

And we actively say no if a use case isn’t mature (too little training data, too high hallucination risk, no clear success metric) — even when it costs us an engagement.

WHEN DO YOU NEED THIS?

When do you need AI consulting?

Four typical situations in which DACH mid-market and corporate departments move from internal tool exploration to structured AI consulting.

01 / TRIGGER

Strategy clarity before the next budget cycle

You’ve reserved an "AI budget" for 2026 — but have no robust roadmap on which use cases to prioritise, what the business case is, and which capabilities must be built internally. We deliver a sober assessment instead of AI hype slides.

02 / TRIGGER

First pilot projects never made it out of the sandbox

Your team has ChatGPT Plus licences, a few custom GPTs and Microsoft Copilot — but nothing has landed productively in the core business. Typical: data flows missing, eval discipline missing, hallucinations unmonitored, costs opaque. We move pilots into production-ready architecture.

03 / TRIGGER

Data protection or compliance blocks further roll-outs

Your legal department or DPO has hit the brakes on the first AI tools — and without a clear technical architecture nothing gets through. We deliver technical building blocks for GDPR-compliant operation, EU hosting setups, data-flow classification and preparation for EU AI Act duties — as a template for legal review by your lawyer or DPO.

04 / TRIGGER

Operationally scaling concrete use cases

You have 1–2 successful AI applications in one area but want to roll them out stably to other departments, languages or markets — including monitoring, cost controlling, model-switch strategy, and training of power users. This is where you leave the "nice demo" stage and need engineering discipline.

Sounds like your project?

30–45 minutes for a first call — free and non-binding. We assess your use-case, estimate effort and risks, and give an honest recommendation — even if it means this is better built elsewhere.

Build AI capability internally or consult with clickpuls?

An honest side-by-side of the two most common paths in mid-market — depending on volume and strategic depth, we also recommend the in-house path.

Kriterium / Criterion
In-house AI lead (1 senior + 1 engineer)
AI consulting · clickpuls
Monthly fully-loaded cost (DACH market)
€ 16–22k (salaries + tools + recruiting)
from € 1,700/month (consulting retainer)
Time-to-productive for first use case
6–12 months (recruiting + onboarding)
4–8 weeks from discovery workshop
Available skill depth
1–2 profiles, knowledge cluster risk
LLM engineering, RAG, agents, eval, integration, hosting
Tracking vendor and model updates
Has to happen on the side
Part of our daily work
Compliance and EU AI Act preparation
Requires extra external legal/DPO advisory
We provide technical templates; legal review through your lawyer/DPO
When in-house pays off
When strategic in-house AI work involves > 5 parallel workstreams
From pilot through scaling several use cases
OUR PROCESS

How AI consulting with clickpuls runs.

Four phases from discovery workshop to a productive use case with clear economics — transparent, plannable, no AI mystery.

01

Discovery & use-case mapping

2–4 weeks. Two workshop half-days with operational people, data-flow capture, use-case assessment along four dimensions (frequency, complexity, fault tolerance, data availability). Output: prioritised roadmap with 3–5 quick wins and 2–3 strategic cases, each with effort estimate and business-case hypothesis. Discovery is its own paid engagement — fixed price quoted upfront.

02

Pilot implementation of the first use case

4–8 weeks. Architecture decision (LLM, RAG, agent, classical ML), tool and model choice, pilot build with eval pipeline (evals against hallucinations, cost monitoring at token level), integration with at least one existing system, human-in-the-loop approval where needed. Fixed-price quote after discovery.

03

Scaling, monitoring and training

4–12 weeks. Roll-out to additional departments / languages / markets, monitoring dashboards (Sentry, Better Stack, own eval reports), cost controlling, power-user training, workflow documentation. On demand, accompanying retainer for ongoing development and model updates in multiple package sizes.

04

Governance, EU AI Act and roadmap update

Quarterly review: use-case classification against EU AI Act risk tiers, logging and data-flow audit, template for your DPO / lawyer for legally binding assessment, model-stack update (new frontier models, new open-source options), roadmap update for the next quarter. Written report included.

KEY FIGURES

Key figures from our AI consulting.

Realistic figures from active AI consulting engagements in DACH mid-market — no hype metrics, no marketing promises.

Discovery workshop
2 × half-day + roadmap

Fixed-price discovery with use-case mapping, prioritised roadmap and business-case hypothesis per case. Concrete quote after a free 30–45 minute first call.

Accompanying retainer (optional)
from € 1,700/month

Monthly hour quota in multiple package sizes for ongoing development, model updates, eval maintenance and scaling new use cases. Unused hours roll over one month.

Response times (standard)
within 24 h on business days

Standard response within 24 h on business days for every request. Faster SLAs (e.g. 4h response in CET business hours or 24/7 on-call) are available as a contractual add-on, not an automatic default.

Hosting / data residency
EU (Hetzner, Fly.io)

Inference containers, vector DBs and integration services run in Frankfurt — EU data processing as the technical prerequisite for GDPR-compliant operation.

Ready for a first call?

30–45 minutes by call, no commitment. Tell us briefly what you need — we get back within one business day with concrete next steps and a realistic effort estimate.

DACH CONTEXT

AI consulting for AT, DE and CH.

DACH mid-market starts from a different position with AI than US tech companies — and we know this reality because most of our clients sit in Vienna, Salzburg, Linz, Munich, Hamburg, Berlin, Düsseldorf, Zurich or Bern.

The central DACH specifics:

  • Higher data-protection expectations with EU AI Act duties in multiple stages from 2025
  • Clear expectation of EU data residency — no appetite for first-time US-cloud commitments with unclear data flows
  • Grown legacy systems (SAP, DATEV, BMD, Dynamics) rather than greenfield
  • Regulated industries with additional sectoral requirements (finance, health, public sector)

Funding landscape we accompany applications in pragmatically:

  • Austria: FFG (innovation vouchers, basic programmes), aws (digitalisation funding), Vienna Business Agency — discovery and pilot phases are eligible across several programmes
  • Germany: "Digital Jetzt" (BMWK) and ZIM (central innovation programme for SMEs)
  • Switzerland: Innosuisse programmes for applied AI research

Our inference containers, vector DBs and integration services run on Hetzner Frankfurt or Fly.io Frankfurt — EU data processing. For especially sensitive workloads we set up on-premise inference on customer infrastructure (Llama, Mistral, Qwen with vLLM or Ollama).

We implement technically; the legally binding GDPR / FADP / EU AI Act assessment is done by your lawyers or DPO — we own the technical architecture and provide the necessary templates cleanly documented. Consulting runs from Vienna; on-site meetings across DACH are included in the accompanying retainer from the larger package upwards.

FREQUENTLY ASKED

Frequently asked questions about AI consulting.

What does AI consulting at clickpuls cost?

We typically start with a fixed-price discovery workshop (2 half-days + roadmap). Effort and price depend on company size and the number of use cases to be assessed, and are quoted concretely after a free first call.

Model overview:

  • Discovery workshop: fixed price after the first call
  • Pilot implementations: separate fixed-price projects
  • Accompanying retainer: monthly from € 1,700 net, with a clearly defined hour quota in multiple package sizes

Mini packages we do not offer because they leave no room for clean eval maintenance or meaningful quarterly reviews. After the first call you receive a concrete fixed-price or retainer offer — binding and comparable.

Which tools and models do you use?

Model- and tool-agnostic — we choose per use case.

  • Frontier models: OpenAI GPT-5, Anthropic Claude Opus 4.7, Google Gemini 2.5 — when maximum reasoning depth is needed and the data flow is legally workable
  • EU / open-source models: Mistral, Llama, Qwen — when data residency is hard-required or volume makes self-hosting economical
  • Embeddings: OpenAI text-embedding-3, BGE-M3, Cohere
  • Vector DBs: pgvector in Postgres, Qdrant, Elasticsearch
  • Agent orchestration: native tool use of the models as default, LangGraph only when complexity justifies it

We build model-agnostic (abstracted via LiteLLM or own wrappers) — no vendor lock-in.

Is your AI consulting GDPR- and EU-AI-Act-compliant?

We cleanly implement the technical prerequisites — a blanket "compliant" guarantee is a legal assessment we do not give as a consulting and engineering team.

What we deliver technically:

  • EU hosting of applications and vector DBs (Hetzner, Fly.io, Frankfurt)
  • Logging discipline and PII masking before the model call
  • Data-processing agreements with all model providers
  • Clear training/inference separation
  • Classification of your use cases against EU AI Act risk tiers plus the technical building blocks for the respective duties

The legally binding assessment must be done separately by your lawyer or data protection officer — we deliver the technical template. The same applies analogously to FADP (Switzerland) and sector-specific rules (BSI-grundschutz, BaFin-MaRisk, etc.).

When does a frontier model make sense, when an open-source model?

We decide per use case based on requirement, volume and compliance — not by tool religion.

  • Frontier (OpenAI, Anthropic, Google) — when the task needs maximum reasoning depth (complex analysis, multi-step planning, code generation), token volume stays manageable (typically < 5–10 million tokens/month) and the data flow is legally workable
  • Open-source (Mistral, Llama, Qwen) — when data residency is hard-required (health, finance, public sector), volume makes self-hosting economical or model stability over years matters more than frontier reasoning

Hybrid setups are common: frontier model for heavy reasoning steps, open-source for high-volume routine tasks like classification or summarisation.

How do you handle hallucinations and quality assurance?

Three lines of defence — productive AI systems need clear guardrails:

  • Architecture choice — for fact-critical tasks always RAG with source attribution instead of a free LLM answer
  • Eval pipeline with a test set — before each release, the new model version or new prompt runs against an eval dataset with expected output, automatically scored
  • Human in the loop for sensitive actions — before the system writes into an existing system (ERP posting, outbound email, contract text), the action is submitted for approval

We don’t sell you the "agent does everything autonomously" fairy tale.

How long until a use case is productive?

Realistic time-to-productive by complexity:

  • First, clearly scoped use case: 4–8 weeks from discovery workshop to a productive pilot with eval pipeline and human-in-the-loop approval
  • More complex cases with deep ERP integration or multilingual roll-out: 8–16 weeks
  • Fully scaled multi-use-case programmes across several departments: 6–12 months, rolled out in waves

We deliver realistic estimates after discovery — no 2-week hype promises.

Do you also offer training and workshops for our teams?

Yes — as standalone fixed-price workshops (typically half-day or day) for power users, domain teams or leadership.

Content by audience:

  • Domain teams: prompt discipline and eval thinking
  • Leadership: architecture decisions and economics
  • Engineering teams: tool stack and integration

Workshops are not a sales vehicle for follow-on consulting — you can book a workshop and be done. On request, workshops are integrated into a broader consulting or implementation engagement.

What happens when a new frontier model is released?

A model switch is a config change with an eval run, not re-architecting — applications at clickpuls are built model-agnostic (abstracted via LiteLLM, OpenRouter or own wrappers).

In the accompanying retainer we evaluate new frontier releases (GPT-5.x, Claude 5, Gemini updates, Mistral releases) for relevant improvements to your use cases and propose a switch when quality, speed or cost meaningfully improve. You don’t sit on a model that’s outdated in two years.

What if our use case isn’t mature enough after your assessment?

We say so openly — and show what needs to happen first.

Typical preconditions:

  • Data consolidation
  • Defining a clear success metric
  • Smaller classic automation as a precursor

We don’t sell you a pilot that’s predictable to fail. Sometimes the honest recommendation is: "let’s talk again in 6 months once the following preconditions are met". The discovery investment is still valuable because you walk away with a clear roadmap and prioritised use-case list — useful for internal communication and budget planning.

Related services.

All services

Sounds like a match?

Request a free consultation
Get in touch