Model Fine-tuning

From CMU -- Language Technologies Institute -- HPC Wiki
Revision as of 01:41, 4 August 2023 by Yifengw2 (talk | contribs) (adding fine tune resources and time estimation suggestions)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Fine-tune Resources

[edit | edit source]

Fine-tune time estimation (Based on previous Hackathon feedback)

[edit | edit source]
  • Fine-tuning ESM-2 model with 35M parameter takes ~4.5 hours
  • Fine-tuning ESM-2 model with 8M parameter takes ~2.5 hours