Actueel…
/net/aps “opruimen”
/net/apsword2vec/etc/Admin/README.txt herzien in overleg met team LWPrsync -av --exclude '/work/*' --exclude '/alpino/*' alpino/.* alpino/* alpino22/cd alpino22git pull --allmake distcleangit checkout 22.04make distcleanLater…
aibrief2.cls
aibrief3.clsWat nog niet werkt:
/net/homepages
/net/homepages naar
/mnt/local stuk sneller: voorbeeld van 6:41 naar 0:18 (nog steeds
te traag?)/net/aps/bin/mkdocs en run /net/homepages/kleiweg/www/triggers/ag737649925.run Ontbrekende pakketten:
git-lfsgolang-1.23 — zelf altijd laatste installeren?htopmesonneovim — te oudninja-buildripgreptree-sitter-cliw3mhttps://urd2.let.rug.nl/~kleiweg/oudeboeken/ ???
word2vec opnieuw gecompileerd. Werkt het goed? Ik krijg veel nan in
de meerdere demo’s, zoals bij deze:
[word2vec] p209327@colossus:~/tmp$ demo-phrase-accuracy.sh Starting training using file news.2012.en.shuffled-norm0 Words processed: 296900K Vocab size: 33198K Vocab size (unigrams + bigrams): 18838711 Words in train file: 296901342 418.30user 8.04system 9:29.32elapsed 74%CPU (0avgtext+0avgdata 3814780maxresident)k 3503464inputs+3386776outputs (2major+953407minor)pagefaults 0swaps Starting training using file news.2012.en.shuffled-norm0-phrase0 Words processed: 280500K Vocab size: 38761K Vocab size (unigrams + bigrams): 21728781 Words in train file: 280513979 416.87user 6.30system 8:00.99elapsed 87%CPU (0avgtext+0avgdata 4157040maxresident)k 0inputs+3386776outputs (0major+1038984minor)pagefaults 0swaps Starting training using file news.2012.en.shuffled-norm1-phrase1 Vocab size: 681320 Words in train file: 283545447 Alpha: 0.000005 Progress: 100.00% Words/thread/sec: 106.85k 39808.72user 33.96system 33:57.92elapsed 1955%CPU (0avgtext+0avgdata 1808924maxresident)k 0inputs+1085240outputs (0major+461369minor)pagefaults 0swaps newspapers: ACCURACY TOP1: 28.12 % (297 / 1056) Total accuracy: 28.12 % Semantic accuracy: 28.12 % Syntactic accuracy: -nan % ice_hockey: ACCURACY TOP1: 20.23 % (142 / 702) Total accuracy: 24.97 % Semantic accuracy: 24.97 % Syntactic accuracy: -nan % basketball: ACCURACY TOP1: 58.17 % (349 / 600) Total accuracy: 33.42 % Semantic accuracy: 33.42 % Syntactic accuracy: -nan % airlines: ACCURACY TOP1: 19.76 % (83 / 420) Total accuracy: 31.35 % Semantic accuracy: 31.35 % Syntactic accuracy: -nan % people-companies: ACCURACY TOP1: 82.29 % (79 / 96) Total accuracy: 33.05 % Semantic accuracy: 33.05 % Syntactic accuracy: -nan % Questions seen / total: 2874 3218 89.31 %
Python
<stdin>:1: DeprecationWarning: 'cgi' is deprecated and slated for removal in Python 3.13