Featured image of post LLMs on your local Computer (Part 1)

LLMs on your local Computer (Part 1)

The Cambrian explosion of Large Language Models (LLMs) happens right now. Ever increasing astonishing models are published and used for text generation tasks ranging from question-answering to fact checking and knowledge interference. Model with sizes ranging from 100 million to 7 billion and more are available with open source licenses. Using these models started from proprietary APIs and evolved to binaries that run on your computer. But which tools exactly can you use? What features do they have? And which models do they support?

Featured image of post Large Language Models: Library Overview for Training, Fine-Tuning, Intererence and More

Large Language Models: Library Overview for Training, Fine-Tuning, Intererence and More

In essence, Large Language Models are neural networks with a transformer architecture. The evolution of LLMs is a history of scaling: input data sources and tokenization, training methods and pipeline, model architecture and number of parameters, and hardware required for training and interference with large language models. For all of these concerns, dedicated libraries emerged that provide the necessary support for this continued evolution.

Featured image of post Large Language Models: Modern Gen4 LLM Overview (LLaMA, Pythia, PaLM2 and More)

Large Language Models: Modern Gen4 LLM Overview (LLaMA, Pythia, PaLM2 and More)

Large Language Models are sophisticated neural networks that produce texts. By creating one word at a time, given a context of other word, these models produce texts that rival humans. The creation of LLMs began back in 2018 and continues up to this data with ever more complex model architectures, consumed amount of texts, and parametric complexity.

Featured image of post Large Language Models: Comparing Gen2/Gen3 Models (Bloom, Gopher, OPT and More)

Large Language Models: Comparing Gen2/Gen3 Models (Bloom, Gopher, OPT and More)

Large Language Models are sophisticated neural networks that produce texts. Since their inception in 2018, they evolved dramatically and deliver texts that can rival humans. To better understand this evolution, this blog series investigates models to uncover how they advance. Specifically, insights from published papers about each model are explained, and conclusions from benchmark comparisons are drawn.

Featured image of post Large Language Models: Comparing Gen2/Gen3 Models (GPT-3, GPT-J, MT5 and More)

Large Language Models: Comparing Gen2/Gen3 Models (GPT-3, GPT-J, MT5 and More)

Large Language Models are sophisticated neural networks that produce texts. By creating one word at a time, given a context of other words, these models produce texts that can rival a humans output. The creation of LLMs began back in 2018 when the transformer neural network architecture was discovered. Since then, ever more complex transformer models in terms of parameter amount, and continues up to this data with ever more complex model architectures, consumed amount of texts, and parametric complexity.

Featured image of post Large Language Models: Comparing Gen 1 Models (GPT, BERT, T5 and More)

Large Language Models: Comparing Gen 1 Models (GPT, BERT, T5 and More)

The creation of Large Language Models (LLMs) began in 2018. Three factors emerged and were combined in LLMs: powerful computer and graphics processing units, huge amounts of structured and unstructured data that could be processed fast, and first-grade open-source project for the creation and training of neural networks.

Featured image of post Introduction to Large Language Models

Introduction to Large Language Models

Large Language Models (LLMs) are a ubiquities technology enabling humans to use their natural language for interacting with a computer in a broad range of tasks. LLMs can answer questions about history and real-world events, they can create step-by-step tasks plans, solve mathematical questions, and can reflect on any input text to create summaries or identify text characteristics. Using most recent LLMs like GPT4 is a fascinating and surprising event.

Featured image of post Raspberry Pico Badger: Hardware Features and OS

Raspberry Pico Badger: Hardware Features and OS

The Raspberry Pico Badger is a consumer product by the Raspberry Pi company Pimorino. Advertised as a portable badge with a 2.9 inch e-ink display and a Pico W, its a portable, programmable microcomputer. This article investigates this unique product from its hardware side as well as the software, apptly named Badger OS.

Featured image of post ESP32 Camera: Hardware and GPIO Functions

ESP32 Camera: Hardware and GPIO Functions

The ESP32 Camera is an ESP32 board nano board with a fixed component, its name giving camera. It has a very compact form factor, and although some pins are directly used for the camera, you still get 10 GPIO pins for connectivity to other sensors.