🛠️ Steven Gong

Search

SearchSearch

Mar 22, 2025, 1 min read

Large Language Models (LLM)

Think GPT3, or what Cohere AI is trying to achieve.

Cohere: “We train large language models”.

How many epochs?

  • Can be only 1
  • https://www.reddit.com/r/LocalLLaMA/comments/1ae0uig/how_many_epochs_do_you_train_an_llm_for_in_the/

Concepts

  • Fine-Tuning

Graph View

Backlinks

  • Cross-Entropy Loss
  • Llama

Created with Quartz, © 2025

  • Blog
  • LinkedIn
  • Twitter
  • GitHub