A2Z-F24
Embeddings
Word Vectors and the Universal Sentence Encoder
π
Understanding word vectors
π
What is word2vec
, π
Color Vectors
π
2013 paper Efficient Estimation of Word Representations in Vector Space
π’
GloVe: Global Vectors for Word Representation
π
2018 Universal Sentence Encoder paper
Embeddings
π₯
What are Word Embeddings?
from IBM
π¨
Embeddings Projector
and
Visualizing High Dimensional Space
,
Atlas
π
Embeddings tutorial
from Cohere
π
What are embeddings?
by Vicki Boykis
π
Embeddings: What they are and why they matter
π
Using open-source models for faster and cheaper text embeddings
Semantic Search and βSimilarityβ
π
What is Semantic Search?
π₯
Cosine Similarity
from StatQuest
Retrieval Augmented Generation (RAG)
π
How to use retrieval augmented generation
Code Examples with Replicate + Transformers.js
π’
All embeddings models on Replicate
,
All embeddings models for transformers.js
π»
Making your own Embeddings βDatabaseβ
, uses
all-mpnet-base-v2
on Replicate and
mixedbread-ai/mxbai-embed-large-v1
with transformers.js
π»
Retrieval Augmented Generation (RAG) with p5.js + Replicate
π»
Embeddings with Transformers.js
Clustering with UMAP dimensionality reduction
Sentence Embeddings with transformers.js and UMAP
Understanding UMAP
umap-js
UMAP p5.js examples
Simple clustering 3D colors -> 2D
Simple clustering random data, adjusting umap parameters
Animated UMAP process + Clustering sentence embeddings from a database!
Assignment
π¨
Prepare to present a final project proposal
.