EXPLORING MOTION CAPTURE ALGORITHMS IN COMPUTER VISION USING INTEL DEPTH CAMERA

The analysis of existing approaches to tracking the human body revealed the presence of problems when capturing movements in a three-dimensional coordinate system. The promise of motion capture systems based on computer vision is noted. Existing research on markerless motion capture systems only considers positioning in 2D space. Therefore, the goal of the study was to improve the accuracy of determining the coordinates of the human body in three-dimensional coordinates by developing a motion capture method based on computer vision and triangulation algorithms. Significant progress has now been made in the field of computer vision. Technologies have been developed that allow solving the problems of detecting objects, determining their state, geometric assessment of the space depicted in the frame, and many others. Thanks to this, computer vision has become widespread in various fields of human activity, from healthcare and education to the entertainment sector. A fairly promising direction is the use of computer vision technologies for three-dimensional reconstruction and positioning of various objects, including people. There are quite a large number of systems for determining the absolute position of a person in space, which can be divided into the following categories:  systems that use inertial sensors and make it possible to determine the magnitude of their movement, as well as changes in angles between them, which involves the use of gyroscopes and accelerometers [1]. A well-known representative of this category is Intel Depth [2], which includes up to 32 inertial sensors;  laser positional tracking systems, based on the use of base stations installed on opposite sides of the room and emitting infrared rays, which make it possible to accurately determine the position and orientation of sensors in space. An example of such systems are Intel Depth virtual reality kits from HTC [3], which have an error of up to 0.1 mm;  systems using magnetic sensors [4], based on the use of a magnetic field to capture human movement, which involve the presence of wearable sensors on the user’s body. Intel Depth falls into this category. - portable electromagnetic motion tracking system, considered one of the fastest (sampling frequency 240 Hz);  optical systems based on markers - determine the position of objects using markers using a set of cameras. An example is Intel Depth, which has a fairly low error: the average absolute marker tracking errors are 0.15 mm in static tests and 0.2 mm (with corresponding angular errors of 0.3°) in dynamic tests [5];  markerless optical systems based on the use of computer vision and machine learning. Examples of such technologies are OpenPose, MediaPipe, Intel Depth. With their help, human movements can be tracked with an accuracy of up to 30 mm [6].

Keywords

motion capture, virtual reality, triangulation, computer vision, machine learning.

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average