scoring – Byte64

PageRank on Wikipedia

Apr 17, 2026

—

by

Andrew

PageRank is a famous scoring function invented and deployed by the Google guys in early days of websearch. It assigns each webpage a score based on the scores of webpages that link to it. As you can see, it’s a recursive definition, but if you use the right formula, then it’ll converge to something meaningful.…

Hybrid Search

Apr 16, 2026

—

by

Andrew

in Search

Andrew

I’ve already written a post on using a piecewise-linear scaling to bring BM25 and my semantic score (from cosine similarity with our embeddings) into the same numerical space. After performing this scaling, I found some important results weren’t scoring as well as they should. In particular, any search query with a common term (e.g. “the”…

Score Normalization

Apr 3, 2026

—

by

Andrew

in Search

Andrew

Currently, I have two different scoring functions: BM25 and the semantic scoring function that comes from our sentence embedding. These scores take very different ranges, but need to be combined to make a final score. It’s not simply a matter of assigning different weights to these scores. We need to stretch them out to make…

BM25 & Search Index Encoding

Apr 2, 2026

—

by

Andrew

in Data Structures, Search

Andrew

Okapi BM25 is a standard ranking formula that has been used in search engines since the 1980s. For each word in a query, it uses the frequency of that word in a document, the length of the document and the number documents that contain the word to decide how significant the word is for the…

Tag: scoring

PageRank on Wikipedia

Hybrid Search

Score Normalization

BM25 & Search Index Encoding