Войти
  • 15219Просмотров
  • 3 месяца назадОпубликованоIlia

LLMs Meet Robotics: What Are Vision-Language-Action Models? (VLA Series Ep.1)

🤖 The first video in the series about Visual Language Action policies for robotics! If you've seen recent videos of robots folding laundry, washing dishes, or organizing entire rooms, they're likely powered by Vision Language Action Models (VLAs) - they type of robotics policies that use the power of retrained LLMs and Visual encoders to control real-life robots. 🎯 What You'll Learn in This Episode: ✅ What are VLAs? - The "LLMs for robots" explained simply ✅ Architecture Deep Dive - How text, vision, and actions combine ✅ Evolution from Single-Task - Why this approach is changing the pattern of policy training 🔥 Coming Up in This Series: - Pi-0 from Physical Intelligence - SmolVLA from HuggingFace LeRobot - Gr00t N1.5 from NVIDIA - Deep discussion about architecture - Overview of fine-tuning VLAs at home on cheap hardware like SO-ARM100 and LeKiwi - And other related topics ⚡ Key Takeaways: - VLAs are essentially LLMs adapted for robot control - They combine pre-trained vision and language understanding - Current models require fine-tuning, but already can show some generalization - We're moving from single-task to multi-task robot policies 🎯 Timestamps: 0:00 Intro 3:41 From LLMs → VLMs → VLAs 6:29 VLA architecture explained 16:47 Policy training patterns 30:03 Preparation for finetunning 34:04 Outro 🎬 Series Playlist: 🎬 Other related videos: - Short intro about SO-ARM: - Big video about SO-ARM and LeRobot: - Intro into LeKiwi: - Multimodality in robotics: - Robot MCP project: - ROS2 robot: 🔗 Resources & Code: - LeKiwi Robot: - SO-ARM Kit: - LeRobot Library: - My Dataset: - Pi-0 Paper: - SmolVLA: - Gr00t: 💬 Join the Discussion: What VLA-related topics do you want me to cover next? Drop your ideas below! I read every comment and often feature community suggestions in future videos. 🔔 Don't miss the next episode! Hit the bell icon - this series releases every few weeks with hands-on tutorials, code walkthroughs, and real robot experiments. ⭐ Found this helpful? Like, subscribe, and share with anyone building the future of robotics! #VLA #Robotics #AI #ArtificialIntelligence #MachineLearning #LLM #VLM #transformers #PhysicalIntelligence #HuggingFace #LeRobot #EmbodiedAI #RobotLearning