Releases · BlackKakapo/Romanian-Word-Embeddings

10 Apr 11:09

BlackKakapo

v1.4

1197e7f

Romanian Word Embeddings – SG & FastText (with PCA) Latest

Latest

🔍 Overview
This release contains pretrained Word2Vec word embeddings for the Romanian language, trained using:

Skip-Gram (SG) and
FastText (FT) architectures
with dimensionality reduction via PCA.

These embeddings are suitable for:

Word-level similarity
Semantic analogy tasks
Input for classic ML models (e.g., classifiers, clustering)
Visualization & exploration

PCA was applied to reduce vector size from 300 ➜ 120 for better efficiency and speed.
Happy embedding!

Assets 2

31 Jan 08:00

BlackKakapo

v1.3

1364f78

FastText

v1.3

Update README.md

Assets 2

24 Jan 13:28

BlackKakapo

v1.2

07d0538

v1.2

Update README.md

Assets 2

20 Dec 14:41

BlackKakapo

v1.1

eae4b09

CBOW

v1.1

Update README.md

Assets 2

11 Dec 10:08

BlackKakapo

v1.0

5fb78a9

CBOW_300_25_5

v1.0

Update README.md

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: BlackKakapo/Romanian-Word-Embeddings

Romanian Word Embeddings – SG & FastText (with PCA)

Uh oh!

FastText

Uh oh!

SG

Uh oh!

CBOW

Uh oh!

CBOW_300_25_5

Uh oh!