Kairós in action
Edge demos powered by the RaiderChip NPU — voice, text, and vision models running locally in real time.
Kairós ASIC Prototype Demonstrator
Edge demos powered by the RaiderChip NPU — voice, text, and vision models running locally in real time.
Kairós ASIC Prototype Demonstrator
- No latency | - No network |
|
- Maximum performance | - Ultra-low power consumption |
More Tokens, less resources: 🔗Efficiency→ 4.4x NVIDIA Jetson Orin Nano Super
|
TSMC 7nm FinFet128 bits LPDDR5X MemoryEmbedded RISC-V CPU>2 TFLOPS FP32 precision10 Watts max |
Sampling 2027 |
Plug & Play simplicity:SPI + USB onboardDesigned for instant setup:from boot to prototyping, everything works right out of the box |
Model |
Input Prompt Processing |
Max Speed per User |
Max Speed per User |
|---|---|---|---|
Meta Llama 2 7B |
142.5 | 11.16 | 36.36 |
Meta Llama 3.18B |
145.38 | 10.02 | 34.32 |
Meta Llama 3.21B |
817.5 | 60.0 | 193.14 |
Meta Llama 3.23B |
330.12 | 23.28 | 78.72 |
Google Gemma 31B |
65.22 | ||
Alibaba Qwen 2.5 Coder1.5B |
506.76 | 46.14 | 128.22 |
Alibaba Qwen 332B |
7.92 |
||
Alibaba Qwen 314B |
17.94 |
||
Alibaba Qwen 38B |
134.88 | 9.9 | 33.66 |
Alibaba Qwen 34B |
56.4 |
||
Alibaba Qwen 31.7B |
509.1 | 42.72 | 139.44 |
Alibaba Qwen 30.6B |
951.78 | 118.38 | 341.76 |
Microsoft Phi 22.7B |
24.12 | 54.66 |
|
Microsoft Phi 3 mini4B |
19.62 | ||
Microsoft Phi 4 mini4B |
18.18 | ||
TII Falcon 31B |
776.34 | 53.16 | 178.44 |
Fraunhofer Teuken 7B |
155.04 | 9.9 | |
DeepSeek R1 Distill Llama8B |
145.68 | 10.02 | 34.32 |
DeepSeek R1 Distill Qwen14B |
69.42 | 5.34 | 17.76 |
DeepSeek R1 Distill Qwen1.5B |
509.64 | 46.14 | |
DeepSeek R1 0528 Qwen 38B |
134.82 | 9.9 | 33.66 |
DeepCoder Preview14B |
17.76 |
||
OpenAI Whisper Small |
311.4 | ||
Vyvo-TTS0.6B |
951.78 | 118.38 | 341.76 |
Moondream 2 2B |
28 | ||
FlowTransformer |
49K |
-Scalable by design-Kairós is built to grow with youStack as many units as your solution requires.No complexity. Just the performance you choose, when you need it. |
An intelligent assistant always available on board
Thanks to its fully offline operation, guarantees reliable availability even in isolated areas without network coverage.
Privacy without compromise.
A truly offline smart home.
Enjoy the full power of Generative AI, protecting the privacy of the ones you love most.
Monitor, diagnose, and act in real time without sending data outside your facility.
Local processing. Instant decisions.
No latency, no unnecessary data traffic.
No connection. No compromise on privacy.
Educational materials and smart toys with minimal power consumption and fully local operation.