Training models on AML, optionally with DeepSpeed

These notebooks show how to fine-tune an NLP model on AzureML. They are intended to be cloned to and executed on an AzureML compute instance within a Jupyter environment. They go through the process of creating a DeepSpeed enabled training environment, creating a compute target (if there isn't one already), preparing and registering datasets, fine tuning a model on those data sets, and registering the resulting output model. This is configured and supported by only a few outside files in the src directory.

Steps

Clone this repo into an interactive session on a fresh AzureML compute instance
From the command line, install the requirements.txt into the local AzureML_Py3.8 conda environment via conda activate azureml_py38 && pip install -r requirements.txt.
Follow the notebooks in numerical order
- 01 Create compute ensures requirements are installed and compute cluster is accessible
- 02 Prepare environment creates an AzureML environment that supports DeepSpeed training
- 03 Prepare data downloads, preprocesses, and registers a dataset for versioned and reproducible training
- 04 Train model launches a distributed fine-tuning job using the outputs of the prior notebooks

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
src		src
.gitignore		.gitignore
01 Create compute.ipynb		01 Create compute.ipynb
02 Prepare environment.ipynb		02 Prepare environment.ipynb
03 Prepare data.ipynb		03 Prepare data.ipynb
04 Train model.ipynb		04 Train model.ipynb
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Training models on AML, optionally with DeepSpeed

Steps

About

Uh oh!

Packages

Languages

License

cdw/deepspeed_in_aml

Folders and files

Latest commit

History

Repository files navigation

Training models on AML, optionally with DeepSpeed

Steps

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Packages 0

Languages

Packages