Header only framework for data analysis in massively parallel platforms.
-
Updated
Dec 6, 2023 - C++
Header only framework for data analysis in massively parallel platforms.
This project uses a multilingual embedding model to align sentences in one language ( preferably a low-resource language) to their potential paired translation in English. The idea is that if we can crawl documents in both languages online (eg from news sites), we can easily pair up sentences that are translations of each other.
Add a description, image, and links to the parallel-data-analysis topic page so that developers can more easily learn about it.
To associate your repository with the parallel-data-analysis topic, visit your repo's landing page and select "manage topics."