Encoder Decoder Model

轻舟的VLA与世界模型架构解读

这张架构图展示的是轻舟智航下一代自动驾驶模型架构，核心理念是将 VLA（Vision-Language-Action，视觉-语言-动作模型）与 World Model（世界模型）融合到一个端到端（End-to-End）的系统中。

Google DeepMind Launches D4RT AI Model for Real-Time 4D Reconstruction

Google DeepMind has released D4RT, a unified AI model for 4D scene reconstruction that runs 18 to 300 times faster than ...

6 天

OCR迎来“闪电时刻”：LightOnOCR-2以1B模型击败9B竞品，开源即达SOTA！

最近， LightOn 在文档理解领域推出了名为 LightOnOCR-2-1B 的全新模型。这个模型仅用10亿的参数量，就在权威的 OCR 评测基准 OlmOCR-Bench ...

AV Network

ISE 2026 Product Watch: Pro AV Standards Are Set to Take over Barcelona

The Alliance for IP Media Solutions (AIMS) will mark a major milestone for Pro AV over IP at ISE 2026 with the official launch of Internet Protocol Me ...

8 天

Rethinking Long/Short Equity: Building On A Foundation Of Stocks, Not Cash

Amid rising interest in capital efficiency, WTLS Fund introduces S&P 500 exposure as the core of a long/short strategy, ...

EurekAlert!

Insilico Medicine launches science MMAI gym to train frontier LLMs into pharmaceutical ...

New “AI GYM for Science” dramatically boosts the biological and chemical intelligence of any causal or frontier LLM, ...

10 天

HOLO 微云全息基于 Masked 预训练 Transformer 的红外光谱反卷积算法

在 Transformer 架构的基础上，微云全息基于“Masked 预训练”策略。这种策略最初源于 BERT 模型在语言理解任务中的成功经验，被证明能够有效捕捉序列中元素间的深层次关系。微云全息研究团队将其迁移到红外光谱数据建模中，提出了一种自监督学习框架，用于从大规模无标签的红外光谱数据中自动学习鲁棒特征。

Scientific Research Publishing

Geo-Refined Point Transformer: Coordinate-Aware Excitation and Positional Upsampling for 3D ...

The proposed Coordinate-Aware Feature Excitation (CAFE) module and Position-Aware Upsampling (Pos-Up) module both adhere to ...

13 天

颠覆移动翻译：谷歌TranslateGemma如何用10亿参数撬动千亿市场

TranslateGemma的发布，标志着语言服务从“中心化云处理”迈入“分布式端侧智能”的新纪元。当1B参数的小模型在手机芯片上流畅运行，当斯瓦希里语的古老谚语通过6nm制程的NPU获得新生，我们见证的不仅是Natural Language ...

CNX Software

NanoPC-T6 Plus Rockchip RK3588 SBC switches from LPDDR4x to LPDDR5 (up to 32GB)

FriendlyELEC has launched the "Plus" variant of the NanoPC-T6 Rockchip RK3588 SBC using up to 32GB LPDDR5, instead of LPDDR4x RAM ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果