bar
heidloff.net - Building is my Passion
Niklas Heidloff
Cancel

Model Distillation for Large Language Models

Model Distillation is a very interesting concept to build small models which are almost as efficient as larger models for specific tasks. This post describes the concept in general and how it can b...

Language Support for Large Language Models

Many of the leading Large Language Models only support limited languages currently, especially open-source models and models built by researchers. This post describes some options how to get these ...

Fine-tuning Models for Question Answering

Question Answering is one of the most interesting scenarios for Generative AI. While base models have often been trained with massive amounts of data, they have not always been fine-tuned for speci...

Hugging Face Transformers APIs

Hugging Face provides the Transformers library to load pretrained and to fine-tune different types of transformers-based models in an unique and easy way. This post gives a brief summary about its ...

Watsonx.ai Trial on the IBM Cloud

Watsonx.ai is IBM’s next generation enterprise studio for AI builders to train, validate, tune and deploy AI models including foundation models. This post describes briefly the available trial vers...

Python and PyTorch for AI Engineers

IT professionals who want to become AI engineers need to learn new technologies. This post summarizes the languages and frameworks that I’ve started to look into recently. As a developer I’ve lear...

Understanding In-Context Learning for LLMs

There are different ways to train and tune LLM models. This post summarizes some interesting findings from a research paper whether prompts can change the behavior of models. There are several opt...

Understanding Chain of Thought Prompting

This post describes the ‘Chain of Thought’ pattern for large language models. The technique is simple, but at the same time powerful to help AI models to understanding reasoning. Some of my previo...

Evaluating Question Answering Solutions

Retrieval Augmented Generation (RAG) solutions like Question Answering are not easy to evaluate. This post summarizes some options with their pros and cons. Question Answering solutions are more t...

Retrieval Augmented Generation with Chroma and LangChain

ChatGPT is more than just the GPT model. Similarly, the AI task Question Answering is also more than invoking just one model. This post describes a simple flow that leverages vector search via Chro...

Disclaimer
The postings on this site are my own and don’t necessarily represent IBM’s positions, strategies or opinions.
Trending Tags