Capstone Project: Humanoid Robot in Action
Objective
Integrate all modules (Physical AI Foundations, ROS 2, Simulation, and VLA) into a single coherent system. The robot should be able to perceive the environment, interpret commands, and perform actions.
Components
- Voice Agent: Converts voice commands into text.
- VLA Module: Processes visual input and command text to generate robot actions.
- ROS 2 Nodes: Execute actions on the robot (simulation or real).
- Simulation Environment: Test robot behavior safely in Gazebo, Unity, or Isaac Sim.
Workflow
- User speaks a command.
- Voice agent converts speech to text.
- VLA module generates a structured robot command.
- ROS 2 nodes execute the command.
- Simulation environment visualizes the outcome.
Independent Test
- Give a voice command and verify the robot performs the intended action in simulation.