Documentation Addition: Training YAML Reference #835

HowWeiBin · 2025-10-16T13:43:27Z

Added a reference page for the training YAML, I have yet to remove the redundant pages

📚 Documentation preview 📚: https://metatrain--835.org.readthedocs.build/en/835/

PicoCentauri · 2025-10-17T09:20:03Z

docs/src/getting-started/train_yaml_config.rst

I think this should replace custom_dataset_conf and advanced_base_config, right?

PicoCentauri · 2025-10-17T09:21:33Z

docs/src/getting-started/train_yaml_config.rst

+``metatrain`` uses a YAML file to specify the parameters for model training,
+accessed via ``mtt train options.yaml``. In this section, we provide a complete reference
+for the parameters provided by the training YAML input.
+


We could add something like

A `sample yaml file <XXX>` is available. This should be appropriate to start a first training. Edit it to suit your specific needs and desires.

PicoCentauri · 2025-10-17T09:23:11Z

docs/src/getting-started/train_yaml_config.rst

My overall idea of this file would be similar to the main option file of GROMACS:

https://manual.gromacs.org/current/user-guide/mdp-options.html

You can maybe scroll through for some inspiration. They have for example the parameters highlighted in red. We could do the same. As they are also using sphinx it should be possible :-)

PicoCentauri · 2025-10-17T09:23:39Z

docs/src/getting-started/train_yaml_config.rst

+
+Computational Parameters
+======================================
+The computational parameters define the computational device, precision and seed. These parameters are optional.


Suggested change

The computational parameters define the computational device, precision and seed. These parameters are optional.

The computational parameters define the computational ``device``, ``precision`` and ``seed``. These parameters are optional.

PicoCentauri · 2025-10-17T09:24:23Z

docs/src/getting-started/train_yaml_config.rst

+    precision: 32
+    seed: 0
+
+:param device [optional]: The computational device used for model training. The script automatically


Suggested change

:param device [optional]: The computational device used for model training. The script automatically

:param device [optional]: The computational device used for model training. The metatrain automatically

PicoCentauri · 2025-10-17T09:25:21Z

docs/src/getting-started/train_yaml_config.rst

+    ``float16`` respectively. The datatypes that can be supported also depends on the model architecture used.
+:param seed [optional]: The seed used for non-deterministic operations and is used to set the seed for ``numpy.random``,
+    ``random``, ``torch`` and ``torch.cuda``. The input must be a non-negative integer. This parameter is important for ensuring
+    reproducibility. If not specified, the seed is generated randomly.


Suggested change

reproducibility. If not specified, the seed is generated randomly.

reproducibility. If not specified, the seed is generated randomly and reported in the log.

PicoCentauri · 2025-10-17T09:26:24Z

docs/src/getting-started/train_yaml_config.rst

+The next set of parameters are also optional and deals with integration with Weights and Biases (wandb) logging. Leaving this
+section blank will simply disable wandb integration. The parameters for this section is the same as that in
+`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input


I thunk this can be shortened a bit maybe along the lines of:

Suggested change

The next set of parameters are also optional and deals with integration with Weights and Biases (wandb) logging. Leaving this

section blank will simply disable wandb integration. The parameters for this section is the same as that in

`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input

Optional section dealing with integration with `Weights and Biases (wandb) <link>`_ logging. Leaving this

section blank will simply disable wandb integration. The parameters for this section is the same as that in

`wandb.init <https://docs.wandb.ai/ref/python/init/>`_. Here we provide a minimal example for the YAML input

PicoCentauri · 2025-10-17T09:26:57Z

docs/src/getting-started/train_yaml_config.rst

+        - tag2
+        notes: This is a test run
+
+All parameters of your options file will be automatically added to the wandb run so


Suggested change

All parameters of your options file will be automatically added to the wandb run so

All parameters of your ``options.yaml`` file will be automatically added to the wandb run so

PicoCentauri · 2025-10-17T09:29:33Z

docs/src/getting-started/train_yaml_config.rst

+Loss
+===================


Just to please my eyes

Suggested change

Loss

===================

Loss

====

HowWeiBin added 5 commits October 7, 2025 17:21

First half

d416eef

First half

3f771a8

1st Round of Doc

3cd7d65

Restore original tox commands

9bcd1fa

Merge branch 'main' into yaml-reference

e8d22be

HowWeiBin requested a review from PicoCentauri October 16, 2025 13:43

PicoCentauri reviewed Oct 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Documentation Addition: Training YAML Reference #835

Documentation Addition: Training YAML Reference #835

Uh oh!

HowWeiBin commented Oct 16, 2025 •

edited by github-actions bot

Loading

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

PicoCentauri Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	The computational parameters define the computational device, precision and seed. These parameters are optional.
	The computational parameters define the computational ``device``, ``precision`` and ``seed``. These parameters are optional.

	:param device [optional]: The computational device used for model training. The script automatically
	:param device [optional]: The computational device used for model training. The metatrain automatically

	reproducibility. If not specified, the seed is generated randomly.
	reproducibility. If not specified, the seed is generated randomly and reported in the log.

	All parameters of your options file will be automatically added to the wandb run so
	All parameters of your ``options.yaml`` file will be automatically added to the wandb run so

Documentation Addition: Training YAML Reference #835

Are you sure you want to change the base?

Documentation Addition: Training YAML Reference #835

Uh oh!

Conversation

HowWeiBin commented Oct 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HowWeiBin commented Oct 16, 2025 •

edited by github-actions bot

Loading