List: NLP | Curated by Yoshiyuki Igarashi

Apr 16, 2024
44 stories
NLP
In
TDS Archive
by
Cristian Leo
The Math Behind Fine-Tuning Deep Neural NetworksDive into the techniques to fine-tune Neural Networks, understand their mathematics, build them from scratch, and explore their…
Apr 3, 2024
7
Apr 3, 2024
7
In
TDS Archive
by
Srijanie Dey, PhD
Deep Dive into Transformers by Hand ✍︎Explore the details behind the power of transformers
Apr 12, 2024
8
Apr 12, 2024
8
In
TDS Archive
by
Alex Honchar
Intro to LLM Agents with Langchain: When RAG is Not EnoughFirst-order principles of brain structure for AI assistants
Mar 15, 2024
17
Mar 15, 2024
17
In
TDS Archive
by
Mariya Mansurova
Text Embeddings: Comprehensive GuideEvolution, visualisation, and applications of text embeddings
Feb 13, 2024
21
Feb 13, 2024
21
In
TDS Archive
by
Benjamin Etienne
A Complete Guide to Write your own TransformersAn end-to-end implementation of a Pytorch Transformer, in which we will cover key concepts such as self-attention, encoders, decoders, and…
Feb 24, 2024
10
Feb 24, 2024
10
In
AI Advances
by
Nirmalya Ghosh
Using Mixtral 8x7B For NLP Tasks On Small GPUsLarge language models (LLM) are made up of billions of parameters, thus posing challenges when loading them onto GPU memory for model…
Jan 1, 2024
5
Jan 1, 2024
5
In
Towards AI
by
Vaibhawkhemka
(New Approach🔥) LLMs + Knowledge Graph : Handling Large documents and data for any industry —…Most of the Real World data replicate Knowledge graph. In this messy world, one will hardly find a linear data of information. AI system…
Dec 12, 2023
1
Dec 12, 2023
1
In
TDS Archive
by
Iulia Brezeanu
How to Detect Hallucinations in LLMsTeaching Chatbots to Say “I Don’t Know”
Dec 31, 2023
12
Dec 31, 2023
12
In
Towards AI
by
Patrick Meyer
Entity Recognition with LLM: A Complete EvaluationLLMs are capable of performing a wide range of NLP tasks, such as named entity recognition. In this study, I tested an open-source library…
Sep 1, 2023
3
Sep 1, 2023
3
Knowledgator Engineering
Achieve 90% Results in Few-Shot Text Classification with Just 0.1% DataZero-shot abilities of modern LLMs are truly inspiring and make us feel that AGI is pretty close. However, it requires large networks and…
Dec 27, 2023
Dec 27, 2023
In
TDS Archive
by
Mina Ghashami
Byte-Pair Encoding For BeginnersAn illustrative guide to BPE tokenizer in plain simple language
Oct 10, 2023
1
Oct 10, 2023
1
In
TDS Archive
by
Vyacheslav Efimov
Large Language Models: RoBERTa — A Robustly Optimized BERT ApproachLearn about key techniques used for BERT optimisation
Sep 24, 2023
Sep 24, 2023
In
Towards AI
by
Rodrigo Agundez
Cosine Similarity for 1 Trillion Pairs of VectorsIntroducing ChunkDot
Apr 4, 2023
7
Apr 4, 2023
7
Fanghua (Joshua) Yu
Efficient Similarity Search & Clustering of Dense Vectors in Neo4jScalable Similarity Search of GPT-3 Text Embeddings
Feb 26, 2023
2
Feb 26, 2023
2
Pierre Guillou
Document AI | Document Understanding model at line level with LiLT, Tesseract and DocLayNet datasetPost about training a Document Understanding model on DocLayNet, and also its finetuning and inference code via 2 notebooks.
Feb 10, 2023
Feb 10, 2023
In
Dev Genius
by
Sung Kim
How to Get Around OpenAI GPT-3 Token LimitsPython Developer’s Guide to OpenAI GPT-3 API
Feb 6, 2023
3
Feb 6, 2023
3
In
Towards AI
by
Sergi Castella i Sapé
Trends in AI — 2023 Round-upWhat’s next for Language Models, Reinforcement Learning, Computer Vision, and leading AI companies like OpenAI and Google?
Jan 25, 2023
1
Jan 25, 2023
1
Olasimbo Arigbabu
Fine-tuning OpenAI GPT-3 to build Custom ChatbotIntroduction
Jan 25, 2023
Jan 25, 2023
Cerebras Systems
Context is Everything: Why Maximum Sequence Length Matters for AIGPU-Impossible™ sequence lengths on Cerebras systems may enable breakthroughs in Natural Language Understanding, drug discovery and…
Aug 17, 2022
2
Aug 17, 2022
2
In
TDS Archive
by
Ayoola Olafenwa
The Concept of Transformers and Training A Transformers ModelStep by step guide on how transformer networks work
Oct 28, 2022
Oct 28, 2022