Skip to content

docs: add all copywrite changes from (#920)#1125

Closed
akoumpa wants to merge 56 commits intoakoumparouli/docs_update_perf_copywritingfrom
main
Closed

docs: add all copywrite changes from (#920)#1125
akoumpa wants to merge 56 commits intoakoumparouli/docs_update_perf_copywritingfrom
main

Conversation

@akoumpa
Copy link
Copy Markdown
Contributor

@akoumpa akoumpa commented Jan 27, 2026

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

  • Add specific line by line info of high level changes in this PR.

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

  • Related to # (issue)

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Jan 27, 2026

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

Copy link
Copy Markdown
Contributor

@jgerh jgerh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reviewed files again. Only two minor copyedits.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- Imports are allowed from common safe prefixes (e.g., `nemo_automodel`, `torch`, `transformers`, …).

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- `str(cfg)` / `repr(cfg)` prints placeholders (e.g., `${DATABRICKS_TOKEN}`), not resolved secrets.

@akoumpa akoumpa changed the title docs: add all copywrite changes from #920 (#1123) docs: add all copywrite changes from (#920) Jan 27, 2026
ZhiyuLi-Nvidia and others added 29 commits February 5, 2026 06:36
* fix pp batch issue

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>

* lint

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>

---------

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>
* stream writes

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add test

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
* explain patch_inner_model and patch_causal_lm_model more

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update pipelining.md

* Update pipelining.md

* Update pipelining.md

* Update docs/guides/pipelining.md

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

* Update pipelining.md

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
* add duration time for test logging (top-10)

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* Update run_test.sh

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* Address CVEs

Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>

* Update uv lock

Signed-off-by: thomasdhc <thomasdhc@users.noreply.github.com>

* Update setuptools

Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>

* Update uv lock

Signed-off-by: thomasdhc <thomasdhc@users.noreply.github.com>

---------

Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
Signed-off-by: thomasdhc <thomasdhc@users.noreply.github.com>
Co-authored-by: thomasdhc <thomasdhc@users.noreply.github.com>
* add qwen3 235b recipe

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add tests

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fmt

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* rm duplicate freeze config

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fix args

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* update vlm test to unfreeze emb

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fix

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* update tests

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

---------

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* chore: refactor common te module

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>

* chore: refactor common te module

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>

* fix test

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>

---------

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>
* add ortho optimizers

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fmt

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* format

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* add shard placement fn and dion tp

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* Add dion config

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* add tests

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* update dep

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* Update uv lock

Signed-off-by: HuiyingLi <HuiyingLi@users.noreply.github.com>

* fix tests

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fmt

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* copyright

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* sync optimizer state

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* update recipe

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* copyright

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* update dep

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* Update uv lock

Signed-off-by: HuiyingLi <HuiyingLi@users.noreply.github.com>

* update dep

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* Update uv lock

Signed-off-by: HuiyingLi <HuiyingLi@users.noreply.github.com>

* revert dion tp

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fix

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fix

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* Update uv lock

Signed-off-by: HuiyingLi <HuiyingLi@users.noreply.github.com>

* fix

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fix test

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

---------

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>
Signed-off-by: HuiyingLi <HuiyingLi@users.noreply.github.com>
Co-authored-by: HuiyingLi <HuiyingLi@users.noreply.github.com>
* update vlm/llm coverage tables

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* recapitalize

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* remove

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* update hints

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* Update README.md

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Fix release docs syntax error

Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
update new models

Signed-off-by: Huiying Li <willwin.lee@gmail.com>
* add custom loss

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* format

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

* fmt

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

---------

Signed-off-by: HuiyingLi <willwin.lee@gmail.com>
* Implements DoRA

Signed-off-by: Naveenraj Kamalakannan <therealnaveenkamal@gmail.com>

* added test_hf_peft_dora_checkpoint

Signed-off-by: Naveenraj Kamalakannan <therealnaveenkamal@gmail.com>

* added cli args to conftest

Signed-off-by: Naveenraj Kamalakannan <therealnaveenkamal@gmail.com>

* name fix

Signed-off-by: Naveenraj Kamalakannan <therealnaveenkamal@gmail.com>

---------

Signed-off-by: Naveenraj Kamalakannan <therealnaveenkamal@gmail.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
* move perf to the top

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add last update footer

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* remove _original_strings

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* fix: add force_hf for vanialla hf llama in hf recipe

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>

* fix llama3 8b

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>

---------

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>
* feat: Add Minimax M2 model implementation

Signed-off-by: Hemil Desai <hemild@nvidia.com>

* fix

Signed-off-by: Hemil Desai <hemild@nvidia.com>

---------

Signed-off-by: Hemil Desai <hemild@nvidia.com>
* feat: Add GroupedExpertsTE backend

Signed-off-by: Hemil Desai <hemild@nvidia.com>

* fix

Signed-off-by: Hemil Desai <hemild@nvidia.com>

* fix

Signed-off-by: Hemil Desai <hemild@nvidia.com>

* fix

Signed-off-by: Hemil Desai <hemild@nvidia.com>

---------

Signed-off-by: Hemil Desai <hemild@nvidia.com>
* lower-bound megatronfsdp version

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* Update uv lock

Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
* add ministral3 parallel plan

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix qwen3 sp

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* pytest import fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* f

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add tests

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add qwen seq

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add fused-linear-ce functional test

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* lint

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* lint

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fmt

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* more fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* sum over vocab

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add alias sharding

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* update filenames

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* update top links

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* staging

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* clean

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* removing *_size from signature

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* refactoring tp plan logic

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* changes to other recipes

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* unit tests

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* fixing configs

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* lint

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

* update tests

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* l2 tests

Signed-off-by: adil-a <adil.asif2000@hotmail.com>

---------

Signed-off-by: adil-a <adil.asif2000@hotmail.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* update filenames

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* update top links

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* Update README.md mimimax

* Update README.md

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
* Fix checkpoint auto-loading and add explicit restore control

Checkpoints are no longer automatically loaded. Users must
explicitly set `checkpoint.restore_from` to resume training.

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* simplify

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* format

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add recipe signature check

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* update behavior: if ckpt exists but incompatible warn+ignore instead of crash

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* use cached assets

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* finally

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* break build_model_and_optimizer into two functions

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* simpify

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
[fix]: temp walk around VocabParallelEmbedding to address OOM

Signed-off-by: Zhiyu Li <zhiyul@NVIDIA.com>
Upgrade gnupg to address cve

Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
@akoumpa
Copy link
Copy Markdown
Contributor Author

akoumpa commented Feb 10, 2026

I'm not sure what happened here, Im closing this one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.