Centre de Calcul Recherche et Technologie with NVIDIA environnement TGCC-CCRT/nvidia
#
Description#
This plugin deals with specific environment characteristics of the cluster at the Très Grand Centre de calcul (France), more specifically the Nvidia GPUs partitions of the Centre de Calcul Recherche et Technologie.
YAML arguments#
The following arguments are used to configure the plugin. pyCIF will return an exception at the initialization if mandatory arguments are not specified, or if any argument does not fit accepted values or type:
Optional arguments#
- python : str, optional, default “python -m mpi4py -rc initialize=False”
the python command used to run sub-instances of pyCIF
- python_venv : str, optional
path to the python virtual environment to use
- python_module : str, optional, default “python3/3.10.6”
the python module to load
- gpu : bool, optional, default False
change the command used to run parallel programs in order to allocate one GPU
- partition : “rome” or “skylake” or “a64fx” or “v100” or “v100l” or “v100l-os” or “hybrid” or “xlarge” or “v100xl”, optional
partition on which to submit job, used as the
-q
option for thecc_msub
command- project : str, optional
project on which to submit job, used as the
-A
option for thecc_msub
command- filesystem : “Any subset of [‘scratch’, ‘work’, ‘store’], separated by commas or ‘all’”, optional, default “all”
the file system(s) required by the job, used as the
-m
option for thecc_msub
command- qos : “long” or “normal” or “test”, optional, default “normal”
Quality of Service (QoS) used to submit job, used as the
-Q
option for thecc_msub
command- walltime : int, optional, default 7200
maximum walltime of the submited job, used as the
-T
option for thecc_msub
command- nodes : int, optional, default 1
number of nodes to use by the jobs launched by the CIF, used as the
-n
option for thecc_msub
command- cores : int, optional, default 1
number of cores to use by the jobs launched by the CIF, used as the
-c
option for thecc_msub
command- submit_msub : bool, optional, default True
Submit the job with
ccc_msub
. If false simply run it within the same instance
Requirements#
The current plugin requires the present plugins to run properly:
Requirement name |
Requirement type |
Explicit definition |
Any valid |
Default name |
Default version |
---|---|---|---|---|---|
model |
True |
True |
None |
None |
YAML template#
Please find below a template for a YAML configuration:
1platform:
2 plugin:
3 name: TGCC-CCRT
4 version: nvidia
5 type: platform
6
7 # Optional arguments
8 python: XXXXX # str
9 python_venv: XXXXX # str
10 python_module: XXXXX # str
11 gpu: XXXXX # bool
12 partition: XXXXX # rome|skylake|a64fx|v100|v100l|v100l-os|hybrid|xlarge|v100xl
13 project: XXXXX # str
14 filesystem: XXXXX # Any subset of ['scratch', 'work', 'store'], separated by commas or 'all'
15 qos: XXXXX # long|normal|test
16 walltime: XXXXX # int
17 nodes: XXXXX # int
18 cores: XXXXX # int
19 submit_msub: XXXXX # bool