Update documentation
The following points should be covered in the documentation:
Sampling
-
There are three sampling methods. We need to explain each of them and their optimization strategies.
regular_raster
random_raster
random_matrix
-
Both
seed
forrandom_raster
andrandom_matrix
should be set to a different value in each run, such as,seed=as.integer(Sys.time())
, unless the user wants reproducible results. Results reproducibility is possible in two ways.- At a specific step.
- An entire classification run.
-
In case it is not possible to find models, increasing the number of
init.samples
is not always the solution. The user should also try to re-sample so a new set of sample points is picked.
Prediction
- For
randomForest
it is possible to set the number of trees, which should be 1/3 of the total number of predictors. For small values, below 100, the value should be odd so the models can be used by different predict functions, but also reproducible between runs. Check related issues for more information and to add support material.
Classification
-
Information to cover issue #57
-
Add information about issue #61 (closed)
Overall
- Describe what are the optimizations for the
optimized_mode
operation mode.
Edited by Romulo Pereira Goncalves