Downloads provided by UsageCounts
EOAD is a collection of videos captured by wearable cameras, mostly of sports activities. It contains both visual and audio modalities. It was initiated by the HUJI and FPVSum egocentric activity datasets. However, the number of samples and diversity of activities for HUJI and FPVSum were insufficient. Therefore, we combined these datasets and populated them with new YouTube videos. The selection of videos was based on the following criteria: The videos should not include text overlays. The videos should contain natural sound (no external music) The actions in videos should be continuous (no cutting the scene or jumping in time) Video samples were trimmed depending on scene changes for long videos (such as driving, scuba diving, and cycling). As a result, a video may have several clips depicting egocentric actions. Hence, video clips were extracted from carefully defined time intervals within videos. The final dataset includes video clips with a single action and natural audio information. Statistics for EOAD: 30 activities 303 distinct videos 1392 video clips 2243 minutes labelled videos clips The detailed statistics for the selected datasets and the crawled videos clips from YouTube are given below: HUJI: 49 distinct videos - 148 video clips for 9 activities (driving, biking, motorcycle, walking, boxing, horse riding, running, skiing, stair climbing) FPVSum: 39 distinct videos - 124 video segments for 8 activities (biking, horse riding, skiing, longboarding, rock climbing, scuba, skateboarding, surfing) YouTube: 216 distinct videos - 1120 video clips for 27 activities (american football, basketball, bungee jumping, driving, go-kart, horse riding, ice hockey, jet ski, kayaking, kitesurfing, longboarding, motorcycle, paintball, paragliding, rafting, rock climbing, rowing, running, sailing, scuba diving, skateboarding, soccer, stair climbing, surfing, tennis, volleyball, walking) The video clips used for training, validation, and test sets for each activity are listed in Table 1. Multiple video clips may belong to a single video because of trimming it for some reasons (i.e., scene cut, temporary overlayed text on videos, or video parts unrelated to activities). While splitting the dataset, the minimum number of videos for each activity was selected as 8. Additionally, the video samples were divided as 50%, 25%, and 25% for training (minimum four videos), validation (minimum two videos), and testing (minimum two videos), respectively. On the other hand, videos were split according to the raw video footage to prevent the mixing of similar video clips (having the same actors and scenes) into training, validation, and test sets. Therefore, we ensured that the video clips trimmed from the same videos were split together into training, validation, or test sets to satisfy a fair comparison. Some activities have continuity throughout the video, such as scuba, longboarding, or riding horse which also have an equal number of video segments with the number of videos. However, some activities, such as skating, occurred in a short time, making the number of video segments higher than the others. As a result, the number of video clips for training, validation, and test sets was highly imbalanced for the selected activities (i.e., jet ski and rafting have 4; however, soccer has 99 video clips for training). Dataset splitting for EOAD Train Validation Test Action Label #Clips Total Duration #Clips Total Duration #Clips Total Duration AmericanFootball 34 00:06:09 36 00:05:03 9 00:01:20 Basketball 43 01:13:22 19 00:08:13 10 00:28:46 Biking 9 01:58:01 6 00:32:22 11 00:36:16 Boxing 7 00:24:54 11 00:14:14 5 00:17:30 BungeeJumping 7 00:02:22 4 00:01:36 4 00:01:31 Driving 19 00:37:23 9 00:24:46 9 00:29:23 GoKart 5 00:40:00 3 00:11:46 3 00:19:46 Horseback 5 01:15:14 5 01:02:26 2 00:20:38 IceHockey 52 00:19:22 46 00:20:34 10 00:36:59 Jetski 4 00:23:35 5 00:18:42 6 00:02:43 Kayaking 28 00:43:11 22 00:14:23 4 00:11:05 Kitesurfing 30 00:21:51 17 00:05:38 6 00:01:32 Longboarding 5 00:15:40 4 00:18:03 4 00:09:11 Motorcycle 20 00:49:38 21 00:13:53 8 00:20:30 Paintball 7 00:33:52 4 00:12:08 4 00:08:52 Paragliding 11 00:28:42 4 00:10:16 4 00:19:50 Rafting 4 00:15:41 3 00:07:27 3 00:06:13 RockClimbing 6 00:49:38 2 00:21:59 2 00:18:50 Rowing 5 00:47:05 3 00:13:21 3 00:03:26 Running 21 01:21:56 19 00:46:29 11 00:42:59 Sailing 7 00:39:30 4 00:14:39 6 00:15:43 Scuba 5 00:35:02 3 00:23:43 2 00:18:52 Skate 91 00:15:53 30 00:07:01 10 00:02:03 Ski 14 01:48:15 17 01:01:59 7 00:39:15 Soccer 102 00:48:39 52 00:13:17 16 00:06:54 StairClimbing 6 01:05:32 6 00:17:18 5 00:20:22 Surfing 23 00:12:51 17 00:06:52 10 00:07:04 Tennis 34 00:27:04 9 00:06:03 9 00:03:14 Volleyball 87 00:19:14 35 00:07:46 7 00:18:58 Walking 49 00:43:02 36 00:38:25 10 00:10:23 Total 30 740 20:22:37 452 09:20:23 200 08:00:08 EOAD Code Repository Scripts for downloading raw videos and trim them in to video clips are provided in this GitHub repository. Regarding the questions, please contact mali.arabaci@gmail.com.
multi-modality, first-person vision, egocentric activity recognition, multi-modality, first-person vision, egocentric activity recognition
multi-modality, first-person vision, egocentric activity recognition, multi-modality, first-person vision, egocentric activity recognition
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 60 | |
| downloads | 16 |

Views provided by UsageCounts
Downloads provided by UsageCounts