Skip to content

Conversation

HowWeiBin
Copy link

@HowWeiBin HowWeiBin commented Oct 16, 2025

Added a reference page for the training YAML, I have yet to remove the redundant pages


📚 Documentation preview 📚: https://metatrain--835.org.readthedocs.build/en/835/

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this should replace custom_dataset_conf and advanced_base_config, right?

``metatrain`` uses a YAML file to specify the parameters for model training,
accessed via ``mtt train options.yaml``. In this section, we provide a complete reference
for the parameters provided by the training YAML input.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We could add something like

A `sample yaml file <XXX>`  is available. This should be appropriate to start a first training. Edit it to suit your specific needs and desires.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My overall idea of this file would be similar to the main option file of GROMACS:

https://manual.gromacs.org/current/user-guide/mdp-options.html

You can maybe scroll through for some inspiration. They have for example the parameters highlighted in red. We could do the same. As they are also using sphinx it should be possible :-)


Computational Parameters
======================================
The computational parameters define the computational device, precision and seed. These parameters are optional.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
The computational parameters define the computational device, precision and seed. These parameters are optional.
The computational parameters define the computational ``device``, ``precision`` and ``seed``. These parameters are optional.

precision: 32
seed: 0

:param device [optional]: The computational device used for model training. The script automatically
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
:param device [optional]: The computational device used for model training. The script automatically
:param device [optional]: The computational device used for model training. The metatrain automatically

``float16`` respectively. The datatypes that can be supported also depends on the model architecture used.
:param seed [optional]: The seed used for non-deterministic operations and is used to set the seed for ``numpy.random``,
``random``, ``torch`` and ``torch.cuda``. The input must be a non-negative integer. This parameter is important for ensuring
reproducibility. If not specified, the seed is generated randomly.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
reproducibility. If not specified, the seed is generated randomly.
reproducibility. If not specified, the seed is generated randomly and reported in the log.

Comment on lines +42 to +44
The next set of parameters are also optional and deals with integration with Weights and Biases (wandb) logging. Leaving this
section blank will simply disable wandb integration. The parameters for this section is the same as that in
`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thunk this can be shortened a bit maybe along the lines of:

Suggested change
The next set of parameters are also optional and deals with integration with Weights and Biases (wandb) logging. Leaving this
section blank will simply disable wandb integration. The parameters for this section is the same as that in
`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input
Optional section dealing with integration with `Weights and Biases (wandb) <link>`_ logging. Leaving this
section blank will simply disable wandb integration. The parameters for this section is the same as that in
`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input

- tag2
notes: This is a test run

All parameters of your options file will be automatically added to the wandb run so
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
All parameters of your options file will be automatically added to the wandb run so
All parameters of your ``options.yaml`` file will be automatically added to the wandb run so

Comment on lines +73 to +74
Loss
===================
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just to please my eyes

Suggested change
Loss
===================
Loss
====

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants