Models for Latin Intertextuality Search
Collection
Models useful for discovering intertextual links between historical Latin authors. • 18 items • Updated
This model is a fine-tuned version of intfloat/multilingual-e5-base for generating embeddings of Latin texts to detect intertextual relationships.
The model is part of the Loci Similes benchmark setup (Schelb et al., 2026), evaluated on expert-verified Latin intertextual links. It is designed to work with the LociSimiles Python package API: https://julianschelb.github.io/locisimiles/api/.
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("julian-schelb/multilingual-e5-base-emb-lat-intertext-v1")
query_embedding = model.encode("Query: arma virumque cano")
candidate_embedding = model.encode("Candidate: arma virumque cano troiae qui primus ab oris")
If prompts are configured in the model, prefer:
prompt_name="query" for query textsprompt_name="match" for candidate texts@misc{schelb2026locisimilesbenchmarkextracting,
title={Loci Similes: A Benchmark for Extracting Intertextualities in Latin Literature},
author={Julian Schelb and Michael Wittweiler and Marie Revellio and Barbara Feichtinger and Andreas Spitz},
year={2026},
eprint={2601.07533},
archivePrefix={arXiv},
primaryClass={cs.IR},
url={https://arxiv.org/abs/2601.07533},
}
Base model
intfloat/multilingual-e5-base