| 170 | == Splitting == |
| 171 | |
| 172 | Sometimes it is necessary to split the data into several root-trees or ascii-files, for exmple to produce a test and trainings sample. For this, two options exist `--split-sequence` (shortcut `-S`) and `--split-quantile` (shortcut `-Q`). |
| 173 | |
| 174 | The first defines a fixed sequence, for example `-S 2 -S 1 -S 4` will write the first two events (2) to the first tree/file, the third event (1) to the second tree/file and the fourth to seventh (4) events to the third tree/file. To split even and odd events into two trees/files you have to use `-S 1 -S 1`. |
| 175 | |
| 176 | To randomly split the data use quantiles. For example, `-Q 0.5` splits the data equally into two samples, `-Q 0.2, -Q 0.5 -Q 0.9` splits the data into four samples of 10%, 20%, 30% and 40%. |
| 177 | |
| 178 | Note that splitting is defined on the rows which are received(!) from the database, i.e. before rows with NULL entries are excluded. |