[HN Gopher] A Visual Intro to Large Language Models
___________________________________________________________________
A Visual Intro to Large Language Models
Author : jalammar
Score : 11 points
Date : 2021-11-22 13:04 UTC (2 days ago)
(HTM) web link (docs.cohere.ai)
(TXT) w3m dump (docs.cohere.ai)
| jalammar wrote:
| Hi HN,
|
| This is the first in a series of articles I'm writing to
| introduce devs to practical applications of large NLP language
| models (for text generations like GPT and for language
| understanding like BERT).
|
| I have been connecting the dots between the capabilities of these
| models and their business application. I still believe we're in
| the beginning of grasping the amount of potential value we can
| extract from these models. Happy to get to share these as I learn
| them from my exposure to the problem space.
|
| Some of the key visual language I'm aiming to simplify is that of
| "prompts" and their use to shape model output (leading to
| practical applications). In this post, a key visual is [1] which
| shows an example of a summarization prompt and [2] showing a
| high-level process of "prompt engineering".
|
| Would appreciate your feedback!
|
| [1] https://docs.cohere.ai/img/intro-llms/language-model-
| prompt.... [2] https://docs.cohere.ai/img/intro-llms/prompt-
| engineering-and...
___________________________________________________________________
(page generated 2021-11-24 23:01 UTC)