| # Dataset splits vs. filtered training / test splits |
|
|
| The NAVSIM framework utilizes several dataset splits for standardized training and evaluating agents. |
| All of them use the OpenScene dataset that is divided into the dataset splits `mini`,`trainval`,`test`,`private_test_e2e`, which can all be downloaded separately. |
|
|
| It is possible to run trainings and evaluations directly on these sets (see `Standard` in table below). |
| Alternatively, you can run trainings and evaluations on training and validation splits that were filtered for challenging scenarios (see `NAVSIM` in table below), which is the recommended option for producing comparable and competitive results efficiently. |
| In contrast to the dataset splits which refer to a downloadable set of logs, the training / test splits are implemented as scene filters, which define how scenes are extracted from these logs. |
|
|
| The NAVSIM training / test splits subsample the OpenScene dataset splits. |
| Moreover, the NAVSIM splits include overlapping scenes, while the Standard splits are non-overlapping. |
| Specifically, `navtrain` is based on the `trainval` data and `navtest` on the `test` data. |
|
|
| As the `trainval` sensor data is very large, we provide a separate download link, which loads only the frames needed for `navtrain`. |
| This eases access for users that only want to run the `navtrain` split and not the `trainval` split. If you already downloaded the full `trainval` sensor data, it is **not necessary** to download the `navtrain` frames as well. |
| The logs are always the complete dataset split. |
|
|
| ## Overview |
| The Table belows offers an overview on the training and test splits supported by NAVSIM. |
| In Navsim-v1.1, the training/test split can bet set with a single config parameter given in the table. |
|
|
| <table border="0"> |
| <tr> |
| <th></th> |
| <th>Name</th> |
| <th>Description</th> |
| <th>Logs</th> |
| <th>Sensors</th> |
| <th>Config parameters</th> |
| </tr> |
| <tr> |
| <td rowspan="3">Standard</td> |
| <td>trainval</td> |
| <td>Large split for training and validating agents with regular driving recordings. Corresponds to nuPlan and downsampled to 2HZ.</td> |
| <td>14GB</td> |
| <td>>2000GB</td> |
| <td> |
| train_test_split=trainval |
| </td> |
| </tr> |
| <tr> |
| <td>test</td> |
| <td>Small split for testing agents with regular driving recordings. Corresponds to nuPlan and downsampled to 2HZ.</td> |
| <td>1GB</td> |
| <td>217GB</td> |
| <td> |
| train_test_split=test |
| </td> |
| </tr> |
| <tr> |
| <td>mini</td> |
| <td>Demo split for with regular driving recordings. Corresponds to nuPlan and downsampled to 2HZ.</td> |
| <td>1GB</td> |
| <td>151GB</td> |
| <td> |
| train_test_split=mini |
| </td> |
| </tr> |
| <tr> |
| <td rowspan="2">NAVSIM</td> |
| <td>navtrain</td> |
| <td>Standard split for training agents in NAVSIM with non-trivial driving scenes. Sensors available separately in <a href="https://github.com/autonomousvision/navsim/blob/main/download/download_navtrain.sh">download_navtrain.sh</a>.</td> |
| <td>-</td> |
| <td>445GB*</td> |
| <td> |
| train_test_split=navtrain |
| </td> |
| </tr> |
| <tr> |
| <td>navtest</td> |
| <td>Standard split for testing agents in NAVSIM with non-trivial driving scenes. Available as a filter for test split.</td> |
| <td>-</td> |
| <td>-</td> |
| <td> |
| train_test_split=navtest |
| </td> |
| </tr> |
| <tr> |
| <td rowspan="2">Competition</td> |
| <td>warmup_test_e2e</td> |
| <td>Warmup test split to validate submission on hugging face. Available as a filter for mini split.</td> |
| <td>-</td> |
| <td>-</td> |
| <td> |
| train_test_split=warmup_test_e2e |
| </td> |
| </tr> |
| <tr> |
| <td>private_test_e2e</td> |
| <td>Private test split for the challenge leaderboard on hugging face.</td> |
| <td><1GB</td> |
| <td>25GB</td> |
| <td> |
| train_test_split=private_test_e2e |
| </td> |
| </tr> |
| </table> |
| |
| (*300GB without history) |
| |
| ## Splits |
| |
| The standard splits `trainval`, `test`, and `mini` are from the OpenScene dataset. Note that the data corresponds to the nuPlan dataset with a lower frequency of 2Hz. You can download all standard splits over Hugging Face with the bash scripts in [download](../download) |
| |
| NAVSIM provides a subset and filter of the `trainval` split, called `navtrain`. The `navtrain` split facilitates a standardized training scheme and requires significantly less sensor data storage than `travel` (445GB vs. 2100GB). If your agents don't need historical sensor inputs, you can download `navtrain` without history, which requires 300GB of storage. Note that `navtrain` can be downloaded separately via [download_navtrain.sh](https://github.com/autonomousvision/navsim/blob/main/download/download_navtrain.sh) but still requires access to the `trainval` logs. Similarly, the `navtest` split enables a standardized set for testing agents with a provided scene filter. Both `navtrain` and `navtest` are filtered to increase interesting samples in the sets. |
| |
| For the challenge on Hugging Face, we provide the `warmup_test_e2e` and `private_test_e2e` for the warm-up and challenge track, respectively. Note that `private_test_e2e` requires you to download the data, while `warmup_test_e2e` is a scene filter for the `mini` split. |
| |