-
Notifications
You must be signed in to change notification settings - Fork 11.2k
Replace LLM article with concise introduction #2070
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from 1 commit
Commits
Show all changes
3 commits
Select commit
Hold shift + click to select a range
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
# LLMs 101: What Even Is It? | ||
--- | ||
### Preface | ||
|
||
Large language models (LLMs) are artificial neural networks trained on vast amounts of text. The most common type is the **transformer**, introduced in the 2017 paper _Attention Is All You Need_. Transformers break text into smaller units called **tokens**, turn those into numbers, run them through layers of matrix math and attention, and then predict what tokens come next. | ||
|
||
OpenAI’s ChatGPT, built on GPT-3, brought this architecture into the spotlight and kicked off today’s wave of AI applications. | ||
|
||
--- | ||
### What can LLMs do? | ||
|
||
Although they’re called _language_ models, LLMs can handle many kinds of data if trained for it. For text, typical uses include: | ||
|
||
- Generating text | ||
- Completing text | ||
- Translating text | ||
- Analyzing or summarizing text | ||
- Answering questions | ||
- Powering agents that perform tasks | ||
|
||
This Cookbook focuses on the first few, since they’re the foundation of most real-world applications. | ||
|
||
--- | ||
### How do they work? | ||
|
||
At their core, LLMs are **prediction engines**. Given some input text, they predict the most likely next token, then the next, and so on until they produce a coherent output. | ||
|
||
Tokens live in an **embedding space**—a mathematical map where related concepts sit near one another. For example, _king_ and _queen_ are neighbors, just as _emperor_ and _empress_ are. This structure allows the model to capture relationships and context when generating text. | ||
|
||
--- | ||
### **Why it matters for you** | ||
|
||
You don’t need every detail of the math to use LLMs effectively. The key takeaway is simple: | ||
**LLMs generate outputs by predicting one token at a time.** | ||
|
||
The rest of this Cookbook will show you how to harness that prediction power—whether you want to generate text, complete thoughts, or analyze documents. |
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The title contains grammatically incorrect phrasing. 'What Even Is It?' should be 'What Are They?' or 'What Is an LLM?' for proper grammar and clarity.
Copilot uses AI. Check for mistakes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Stylistic to make it more friendly.