Multimodal Model - 搜索 News

2 天on MSN

China’s Moonshot releases a new open-source model Kimi K2.5 and a coding agent

The company said that the model was trained on 15 trillion mixed visual and text tokens.

2 天on MSN

Move over, Claude: Moonshot's new AI model lets you vibe-code from a single video upload

Move over, Claude: Moonshot's new AI model lets you vibe-code from a single video upload ...

15 天

New Apple model combines vision understanding and image generation with impressive results

Manzano combines visual understanding and text-to-image generation, while significantly reducing performance or quality trade-offs.

Business Wire

Ai2 Releases Molmo 2: State-of-the-Art Open Multimodal Family for Video and Multi-Image ...

SEATTLE--(BUSINESS WIRE)--Ai2 (The Allen Institute for AI) today announced Molmo 2, a state-of-the-art open multimodal model suite capable of precise spatial and temporal understanding of video, image ...

4 天

Talking to the Moon: World’s First Multimodal Foundation Model for Lunar Exploration and ...

The same AI methods that power ChatGPT can now allow you to talk to the Moon Its good to be skeptical when applying ...

SiliconANGLE

Mistral unveils Pixtral 12B, a multimodal AI model that can process both text and images

Mistral AI, a Paris-based artificial intelligence startup, today unveiled its latest advanced AI model capable of processing both images and text. The new model, called Pixtral 12B, employs about 12 ...

InfoWorld

Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

Microsoft has introduced a new AI model that, it says, can process speech, vision, and text locally on-device using less compute capacity than previous models. Innovation in generative artificial ...

techtimes

Kling AI Unveils Unified Multimodal Video Model O1 and Video 2.6 to Reshape Creative Production

Kling AI, an AI-powered creative platform, is rolling out a suite of generative AI models designed to streamline how visual and audio content are made, a move that underscores the company's efforts to ...

Computerworld

OpenAI announces new multimodal desktop GPT with new voice and vision capabilities

OpenAI announced what it says is a vastly superior large language model capable of interacting with human-like speeds using text, voice, and visual prompts. But at least one analyst said the company ...

The Manila Times

GUI Model Second Only to Claude: MiningLamp Technology's AI-powered Global Marketing ...

On January 25th, the finals of the 3rd China's Innovation Challenge on Artificial Intelligence Application Scene (CICAS) ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果