"Word2vec"의 두 판 사이의 차이
둘러보기로 가기
검색하러 가기
imported>Pythagoras0 |
imported>Pythagoras0 |
||
8번째 줄: | 8번째 줄: | ||
==memo== | ==memo== | ||
* https://medium.com/@klintcho/doc2vec-tutorial-using-gensim-ab3ac03d3a1 | * https://medium.com/@klintcho/doc2vec-tutorial-using-gensim-ab3ac03d3a1 | ||
+ | |||
+ | |||
+ | ==related items== | ||
+ | * [[Singular value decomposition]] | ||
2017년 5월 2일 (화) 08:09 판
gensim
- https://rare-technologies.com/word2vec-tutorial/
- document similarity
- https://rare-technologies.com/performance-shootout-of-nearest-neighbours-contestants/
- Using gensim’s memory-friendly streaming API I then converted these plain text tokens to TF-IDF vectors, ran Singular Value Decomposition (SVD) on this TF-IDF matrix to build a latent semantic analysis (LSA) model and finally stored each Wikipedia document as a 500-dimensional LSA vector to disk.
memo