The Semantic Cache is a vector database wrapped with bitcoin payments. Users can query against the cache and if there's a sufficiently high similarity score for the semantic search, the cached result is returned. If no sufficiently similar match is found, it returns an invoice for the user to pay to run the query against an LLM.