Correlations: UI for debugging correlations of text and image embeddings

2 days ago 3

A simple UI for debugging correlations of text & image embeddings. At Jina AI we use this tool for vibe-checking embeddings, late chunking, multimodal retrieval etc.

Click the image below to watch a video demo on Youtube.

npm install # for get embedding from local file/remote url export JINA_API_KEY=your_jina_key_here npm run embed -- https://jina.ai/news/jina-embeddings-v3-a-frontier-multilingual-embedding-model -o v3-blog.jsonl -t retrieval.query npm run embed -- https://arxiv.org/pdf/2409.10173 -o v3-arxiv.jsonl -t retrieval.passage # if you just want a vibe-check of this tool, we have two jsonl pre-embedded in the repo npm run corr -- v3-blog.jsonl v3-arxiv.jsonl

JINA_API_KEY is used for embedding and reading content from a URL when necessary, reading from local text file is of course supported. You can also bring your own embeddings and do npm run corr ... for visualization, in which case you don't need JINA_API_KEY.

You can get a Jina API Key with free 10M tokens from https://jina.ai/#apiform. Otherwise, just do npm run corr -- v3-blog.jsonl v3-arxiv.jsonl for a simple vibe-check with our pre-embedded data.

The correlation visualization is served on http://localhost:3000 by default. Here's what you can do:

Switch Layout: Click buttons in top-right to toggle between single view, vertical split, or horizontal split
Hover Cells: See full text and similarity score for any cell
Draw Selection: Click and drag to select regions for analysis

Always visible in top-right corner, or move the control panel anywhere on screen.

Adjust Thresholds: Use sliders to filter by cosine similarity score (0-1) and text length
Change Colors: Switch between RdBu, Viridis, Plasma, or Inferno color schemes

Draw a selection rectangle on the heatmap to open the panel.

Copy Text: Copy selected content to clipboard
Close Panel: Click X to dismiss selection

Click "Vertical" or "Horizontal" layout button to open this table.

Hover Rows: See full text in tooltip
Scroll: Browse through 100 correlations at the borderline

You can do either npm run embed or npm run corr. Note the double dash -- is required for passing args, it tells npm that the following arguments should be passed to the script rather than being interpreted as npm options.

npm run embed uses Jina Embedding API, you can get a Jina API Key with free 10M tokens from https://jina.ai/#apiform. After that, make sure you do export JINA_API_KEY=your_jina_key_here.

Argument Description Default

<input>	Local text file path or a URL	required
-c, --chunk <type>	Chunking type (newline, punctuation, characters, regex)	newline
-v, --value <value>	Value for chunking (number for characters, regex pattern)	-
-d, --dimensions <number>	Embedding dimensions	1024
-l, --late-chunking	Enable late chunking	false
-t, --task-type <type>	Task type (text-matching, retrieval.passage, retrieval.query)
-m, --model <name>	Model name (jina-embeddings-v3, jina-clip-v2 for images)	jina-embeddings-v3
-o, --output <path>	Output JSONL file path	auto-generated

Argument Description Default

<file1>	First JSONL file with embeddings	required
[file2]	Second JSONL file with embeddings (optional)	-
-p, --port <number>	Port for visualization server	3000

# visit jina.ai to get a free key export JINA_API_KEY=your_jina_key_here

Embed a text file with default settings (newline chunking):

npm run embed -- input.txt -o file1.jsonl

Embed content from a URL:

npm run embed -- https://example.com/article -o file2.jsonl

Behind the scene, it calls Jina Reader to crawl the content behind the given URL.

Visualize self-correlations within a single file:

npm run corr -- file1.jsonl

Visualize correlations between two files:

npm run corr -- file1.jsonl file2.jsonl

Advanced Embedding Options

Use punctuation-based chunking:

npm run embed -- input.txt --chunk punctuation

Split by character count:

npm run embed -- input.txt --chunk characters --value 500

Use custom regex pattern for chunking:

npm run embed -- input.txt --chunk regex --value "\\n\\n"

Enable late chunking:

npm run embed -- input.txt --late-chunking

Specify task type for better embeddings:

npm run embed -- input.txt --task-type text-matching npm run embed -- input.txt --task-type retrieval.passage npm run embed -- input.txt --task-type retrieval.query

Custom output file:

npm run embed -- input.txt --output custom.jsonl

Embed images using jina-clip-v2, where each line in all_images.txt can be either a datauri string (without prefix) or an image URL:

npm run embed -- all_images.txt --model jina-clip-v2

Image will be also displayed in the hover tooltip and table when the input is recognized as image. Here's an example of self-correlations of image embeddings:

Correlations of text-image embeddings are also supported.

Change visualization server port:

npm run corr -- file1.jsonl --port 8080

Compare embeddings with different dimensions:

npm run embed -- input.txt --dimensions 512 npm run corr -- output.jsonl

Bring Your Own Embeddings

To use npm run corr, your data must be in a JSONL file where each line is a JSON object with:

chunk: string The text segment, an image local file path or an image url
embedding: Array<number> The embedding vector

Example line:

{"chunk": "This is a text segment.", "embedding": [0.123, -0.456, ...]} {"chunk": "https://raw.githubusercontent.com/jina-ai/image-assets/main/2021-top-100.png", "embedding": [0.123, -0.456, ...]}

If you bring your own embeddings: