Waymo is publicizing more data from its autonomous vehicles, which it says is for the benefit of the research community. Building on the trove of data it released in 2019, the Alphabet company is calling this latest batch “the largest interactive dataset yet released for research into behavior prediction and motion forecasting for autonomous driving.”
This “motion dataset” includes over 100,000 segments, each around 20 seconds long, of objects like cars and people and their trajectories, as captured by Waymo’s sensor-laden vehicles. The company has included corresponding 3D maps and geographic details in each segment to provide researchers with context for their prediction modeling. In total, Waymo says it is releasing 570 hours of “unique data.”
A wide variety of road types in different urban environments are featured, including San Francisco, Phoenix, Mountain View, Los Angeles, Detroit, and Seattle. Waymo says it included data mined specifically for its unique qualifications for research purposes, including “cyclists and vehicles sharing the roadway, cars quickly passing through a busy junction, or groups of pedestrians clustering on the sidewalk.”
Waymo also claims that its “perception boxes,” or the colored boxes used to distinguish different objects on the road, are “state of the art” and superior to other open-source datasets. “This is important because the better the perception system is at tracking the objects and agents in the motion data, the more accurate the resulting behavior prediction model will be at predicting what they are going to do,” writes Drago Anguelov, head of research at Waymo.
To go along with the data dump, Waymo is also launching an open dataset challenge to encourage research teams in their work on behavior and prediction. There are four challenges: motion prediction, interaction prediction, real-time 3D detection, and real-time 2D detection. The winner of each challenge will receive $15,000, with second-place teams receiving $5,000 and third place $2,000.
Waymo isn’t the first company to release an open dataset. In March 2019, Aptiv became one of the first large AV operators to publicly release a set of its sensor data. Cruise, a majority-owned subsidiary of General Motors, also released AV visualization tools to the public. Argo, which is backed by Ford, has also released a dataset of HD maps called the “Argoverse.”