The multi-modal motion tracking visual acquisition system is a high-precision motion capture and analysis platform that integrates multiple sensing technologies. By combining vision sensors, inertial measurement units (IMUs) and other sensors, the system captures multi-dimensional data of human movement in real time for accurate motion analysis. It is widely used in sports training, rehabilitation, virtual reality, robot control and other fields, providing users with detailed sports data analysis and optimization suggestions. The system can not only capture local movements such as hands and feet, but also efficiently track the overall movement trajectory, help users improve sports performance, prevent sports injuries, and provide important data support for follow-up research and development.