2025.04.07

When large AI models no longer rely on the cloud and compute sinks into every embedded device, Rockchip’s RK1820 coprocessor is turning “on-device intelligence” into reality.

The RK1820 is Rockchip’s add-on accelerator for flagship SoCs such as RK3576/RK3588. With advanced packaging, high performance, low power and multi-modal capability, it brings robust support for deploying big models at the edge.

RK1820 at a glance

Model support: up to 3 B/7 B-parameter on-device LLMs (16 K context)

Performance: > 100 tokens/s generation, < 0.1 s end-to-end latency

Modalities: text, voice, image, video; CNN compatible

Host links: PCIe 2.0 / USB 3.0; plug-and-play with RK3576, RK3588

Software: HuggingFace, PyTorch, GGUF; OpenAI-style API; C/Python bindings

Its high-bandwidth + low-power design breaks the energy/latency barrier for edge LLMs and delivers cloud-class responsiveness on site.

3-D TSV stack

Vertical die interconnect boosts bandwidth 10×, cuts power 30 % and halves footprint, integrating logic, memory and sensors in one package without adding volume.

About 3 B & 7 B models

3 B (30 B params): lightweight yet capable, runs on phones/edge boxes for chat, summarisation, coding, QA, translation, extraction—near 10 B-class quality offline.

7 B (70 B params): server/IPC/high-end laptop grade, handles long-doc summary, logical reasoning, code-gen, multi-turn dialogue, multimodal fusion—cloud-grade depth without the cloud.

Benchmarks

(see table in original)

Three key advantages

Host-ecology ready – PCIe/USB link needs no BSP change; works out-of-the-box on RK3568/RK3576/RK3588.

Partitioned compute – host runs OS/UI/I/O; RK1820 runs LLM, vision, semantics. Shared cache + high-speed bus isolate tasks and save power.

Independent upgrade – coprocessor evolves separately. Next-gen RK1860 will deliver > 64 TOPS and 13 B-model support at > 1 TB/s bandwidth, filling the domestic high-end gap.

Deployment snapshots

Education tablet – offline “AI teacher” with Qwen 3 B/7 B for spoken-English scoring, essay correction and tutoring without networking.

Auto cockpit – RK3588 + RK1820 supports 10+ concurrent voice agents, eliminating cloud latency for in-car multi-role dialogue.

Robotics – Qwen2.5-3 B emotion model on RK1820 gives low-power speech, sentiment and vision understanding.

Enterprise AI terminals – legacy RK3568/RK3399 boxes gain 100 token/s AI via USB/PCIe plug-in, adding ASR, image retrieval and text generation instantly.

Previous：Domestic AI chip evolves again! Complete analysis of RK3576 System On Module + Development board Next：AI Compute Is More Than the Main SoC! A Clear Look at RK1820's Real Role in the RK3588 System

Return

NEWS

2025.04.07

Enterprise Open Source Hardware Platform

Contact us for cooperation.