torchtune dry-run feature request #2453

agunapal · 2025-03-03T20:03:40Z

New feature request

I know torchtune does some validation checks to ensure the prompt is not malformed.

But we have no way of knowing if SFT is happening with the right input/output format.

What would be great is to have a torchtune dry-run command, which does the following

It will take the first row of the dataset , show the string of input/output being sent to the loss function
It also shows the tokenized version of the above.

This way one can visually inspect and be sure that torchtune has been configured correctly for fine-tuning.

The text was updated successfully, but these errors were encountered:

init27 · 2025-03-03T20:32:10Z

Great minds chai alike 😁 #2452

felipemello1 · 2025-03-04T18:44:32Z

Hey @agunapal , thats a good request! We do not have bandwidth to look into at this moment, but if you want to propose an RFC (PR with high level ideas on how to implement it) we could review it.

As a sanity check, you could clone the recipe and modify the training loop to inspect the model inputs + tokenizer.decode. Would that work for you?

felipemello1 · 2025-03-04T18:47:50Z

just saw your comment @init27 ! I will close your issue, since i have replied here already, so we can consolidate the conversation. I will check if someone from the community has interest in picking it up, since both of you are interested.

felipemello1 · 2025-03-04T18:48:19Z

Just so I can understand the request better: You guys want to do a full epoch on the dataloader, but without training, to see if there are dataset issues. And you want to be able to possibly print or store the input/output of that dataloader, to confirm that they look like they should. Is that it, or is there something else?

@agunapal @init27

init27 · 2025-03-04T20:50:58Z

Yes exactly, thanks for confirming!

The idea is instead of the training loop crashing on us mid-loop, have a method to validate all message examples.

For an example:

Right now im using synthetic conversations, sometimes these have duplicate assistant messages which go unnoticed so this crashes mid-training. Having a way to run the test cases 'offline'/before FT would be really great!

agunapal · 2025-03-04T23:11:48Z

@felipemello1 Yes, for my use case, I am using a custom prompt template, a custom dataset and llama-guard (the tokenizer is slightly different). I want to make sure that the model is getting the current input and its not a case of garbage in, garbage out. Currently I am adding prints to get around this. It would be nice to have this utility to visual inspect the final prompt and the corresponding tokens.

felipemello1 added enhancement New feature or request triage review This issue should be discussed in weekly review labels Mar 4, 2025

felipemello1 mentioned this issue Mar 4, 2025

Feature Request: Validating dataset contents #2452

Closed

felipemello1 added the community help wanted We would love the community's help completing this issue label Mar 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torchtune dry-run feature request #2453

torchtune dry-run feature request #2453

agunapal commented Mar 3, 2025 •

edited

Loading

init27 commented Mar 3, 2025

felipemello1 commented Mar 4, 2025 •

edited

Loading

felipemello1 commented Mar 4, 2025 •

edited

Loading

felipemello1 commented Mar 4, 2025 •

edited

Loading

init27 commented Mar 4, 2025

agunapal commented Mar 4, 2025

torchtune dry-run feature request #2453

torchtune dry-run feature request #2453

Comments

agunapal commented Mar 3, 2025 • edited Loading

init27 commented Mar 3, 2025

felipemello1 commented Mar 4, 2025 • edited Loading

felipemello1 commented Mar 4, 2025 • edited Loading

felipemello1 commented Mar 4, 2025 • edited Loading

init27 commented Mar 4, 2025

agunapal commented Mar 4, 2025

agunapal commented Mar 3, 2025 •

edited

Loading

felipemello1 commented Mar 4, 2025 •

edited

Loading

felipemello1 commented Mar 4, 2025 •

edited

Loading

felipemello1 commented Mar 4, 2025 •

edited

Loading