Word Embeddings

This page contains Word2Vec and FastText models, the purpose-built financial word embeddings for financial textual analysis. Dow Jones Newswires Text News Feed from January 1, 2000, to September 14, 2015, is used for developing these financial word embeddings. This contains millions of news stories (2,733,035 unique tokens) covering finance, economics, politics, etc., from various news agencies worldwide. Also, extensive text pre-processing is applied to ensure this big textual data is empty of redundant characters, sentences, and structures. Four variations are available to download, containing Word2Vec and FastText algorithms via CBOW and Skip-gram models.