Reference Architecture

01. Data & signals

This is the system’s “ground truth” — high-volume, high-variance inputs such as:

Design note: IoT is temporal. You need time-windowing, baselines, anomaly and change-point awareness — not just raw ingestion.

Context turns data into meaning. This layer provides structure and constraints, such as:

Outcome: Models and tools can reason with “operational truth,” not isolated metrics.

GenAIoT relies on retrieval to ground outputs and reduce hallucinations:

Time-series retrieval: baselines, historical windows, similar incidents, correlated signals
Document retrieval: manuals, SOPs, tickets, runbooks, safety policies, vendor notes
Hybrid retrieval: merge telemetry context with text sources and rank by relevance/recency

Best practice: require citations/provenance for any recommendation that influences action.

Different tasks require different models and placements:

Routing: select models by latency, cost, privacy, and reliability targets
On-device vs cloud: edge inference for low latency/resilience; cloud for heavy reasoning
Small vs large models: smaller models for classification/extraction; larger for synthesis/planning
Guarded generation: constrained outputs, deterministic checks, and evaluation gates

Rule of thumb: don’t pick “the best model.” Pick the best system behavior for the task.

This is where GenAIoT moves from insight to operational impact:

Key requirement: tool use must be governed (policy checks, approvals, rollback).

This is what makes GenAIoT deployable at scale:

Evals & testing: golden sets, scenario tests, regression, red-teaming
Audit logs: who/what/when/why + source citations + tool calls + outcomes
Safety envelopes: allowed actions, thresholds, approvals, rollback conditions
Prompt/tool governance: prompt registry, tool permissions (RBAC/ABAC), rate limits
Monitoring: drift, hallucination indicators, failure modes, cost, latency, reliability

Principle: If you can’t observe it, you can’t govern it. If you can’t govern it, you can’t scale it.