-
Notifications
You must be signed in to change notification settings - Fork 16
Documentation Addition: Training YAML Reference #835
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this should replace custom_dataset_conf
and advanced_base_config
, right?
``metatrain`` uses a YAML file to specify the parameters for model training, | ||
accessed via ``mtt train options.yaml``. In this section, we provide a complete reference | ||
for the parameters provided by the training YAML input. | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We could add something like
A `sample yaml file <XXX>` is available. This should be appropriate to start a first training. Edit it to suit your specific needs and desires.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My overall idea of this file would be similar to the main option file of GROMACS:
https://manual.gromacs.org/current/user-guide/mdp-options.html
You can maybe scroll through for some inspiration. They have for example the parameters highlighted in red. We could do the same. As they are also using sphinx it should be possible :-)
|
||
Computational Parameters | ||
====================================== | ||
The computational parameters define the computational device, precision and seed. These parameters are optional. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The computational parameters define the computational device, precision and seed. These parameters are optional. | |
The computational parameters define the computational ``device``, ``precision`` and ``seed``. These parameters are optional. |
precision: 32 | ||
seed: 0 | ||
|
||
:param device [optional]: The computational device used for model training. The script automatically |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
:param device [optional]: The computational device used for model training. The script automatically | |
:param device [optional]: The computational device used for model training. The metatrain automatically |
``float16`` respectively. The datatypes that can be supported also depends on the model architecture used. | ||
:param seed [optional]: The seed used for non-deterministic operations and is used to set the seed for ``numpy.random``, | ||
``random``, ``torch`` and ``torch.cuda``. The input must be a non-negative integer. This parameter is important for ensuring | ||
reproducibility. If not specified, the seed is generated randomly. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
reproducibility. If not specified, the seed is generated randomly. | |
reproducibility. If not specified, the seed is generated randomly and reported in the log. |
The next set of parameters are also optional and deals with integration with Weights and Biases (wandb) logging. Leaving this | ||
section blank will simply disable wandb integration. The parameters for this section is the same as that in | ||
`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I thunk this can be shortened a bit maybe along the lines of:
The next set of parameters are also optional and deals with integration with Weights and Biases (wandb) logging. Leaving this | |
section blank will simply disable wandb integration. The parameters for this section is the same as that in | |
`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input | |
Optional section dealing with integration with `Weights and Biases (wandb) <link>`_ logging. Leaving this | |
section blank will simply disable wandb integration. The parameters for this section is the same as that in | |
`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input |
- tag2 | ||
notes: This is a test run | ||
|
||
All parameters of your options file will be automatically added to the wandb run so |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All parameters of your options file will be automatically added to the wandb run so | |
All parameters of your ``options.yaml`` file will be automatically added to the wandb run so |
Loss | ||
=================== |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just to please my eyes
Loss | |
=================== | |
Loss | |
==== |
Added a reference page for the training YAML, I have yet to remove the redundant pages
📚 Documentation preview 📚: https://metatrain--835.org.readthedocs.build/en/835/