« Translate sign language into spoken text with this AI model developed by Google. Improve the accessibility of video content and facilitate communication between deaf and hearing people »

AI about to hit the market and the latest advances
« Create and play fully AI-generated video games in real time. Modify worlds via text commands, enjoy photorealistic graphics and infinite experiences in every genre, from GTA to racing »
« Next-generation AI for creating interactive virtual worlds in real time at 720p and 24 fps. You can explore several minutes of coherent, modifiable scenes via text »
« Interactive, real-time, scalable video simulations: smooth at 1080p/60 fps, multimodal generation from images or text, and granular editing via prompts »
« Automate your web tasks directly in Chrome with AI that reads, clicks, and navigates websites for you. Summarize pages, draft emails, and manage your calendar via an integrated extension »
« Generate realistic videos or simulations of environments and agents: Dreamer4 uses deep reinforcement learning to explore and learn from new virtual worlds »
« Control an AI agent that learns, reasons, and plays live in virtual 3D worlds: SIMA2 includes natural language instructions, emojis, transferable concepts, and improves on its own thanks to Gemini. It collaborates, plans, and acts like a real teammate, even in unfamiliar games »
« Understand and edit your videos with precise spatio-temporal localization, object detection, and comprehensive editing instructions. The model analyzes long videos, answers your questions, and automates cropping, transitions, and narration »
« Run your GPU workloads in a confidential environment protected by Intel TDX (with cryptographic proof of integrity). Confidently deploy certified TDX images, encrypted storage, and hardware isolation »
« An AI architecture with integrated long-term memory, selective updating based on a surprise signal, and context extended beyond 2 million tokens. MIRAS unifies transformers and linear RNNs thanks to optimized associative memory »
« A 3D model that generates interactive 4D objects (cars with spinning wheels, automatic scripts) from textual instructions. Tokenization of 3D shapes, generation of shapes/scenes from text, and real-time character animation »
« This 15-billion-parameter video model (developed by Alibaba) took first place in the Artificial Analysis rankings for video generation, ahead of Seedance 2.0. It generates text-to-video and image-to-video clips in 1080p with lip-sync in seven languages, all within a single unified pipeline »