Created
May 20, 2015 02:07
-
-
Save juanpaco/de1508b030925c1d907c to your computer and use it in GitHub Desktop.
Output running ./demo.py infovis
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
-------------------------------------------------------------------------------- | |
Build a topic model (mallet) using a demo dataset (infovis) | |
database = data/demo/infovis/corpus | |
corpus = data/demo/infovis/corpus | |
model = data/demo/infovis/model-mallet | |
app = infovis_mallet | |
-------------------------------------------------------------------------------- | |
Available: data/demo/infovis/corpus | |
Available: tools/mallet-2.0.7 | |
Available: tools/mallet-2.0.7 | |
Available: tools/corenlp-3.3.1 | |
-------------------------------------------------------------------------------- | |
Training an LDA topic model using MALLET... | |
corpus = data/demo/infovis/corpus/corpus.txt | |
model = data/demo/infovis/model-mallet | |
token_regex = \w{3,} | |
topics = 20 | |
iters = 1000 | |
-------------------------------------------------------------------------------- | |
Already exists: data/demo/infovis/model-mallet | |
-------------------------------------------------------------------------------- | |
Import a MALLET LDA topic model as a web2py application... | |
app_name = infovis_mallet | |
app_path = apps/infovis_mallet | |
model_path = data/demo/infovis/model-mallet | |
corpus_filename = data/demo/infovis/corpus/corpus.txt | |
database_filename = data/demo/infovis/corpus/corpus.db | |
-------------------------------------------------------------------------------- | |
Creating app: infovis_mallet [apps/temp_20150519_190201_826701_6147] | |
Creating folder: [apps/temp_20150519_190201_826701_6147/data] | |
Creating folder: [apps/temp_20150519_190201_826701_6147/databases] | |
Linking folder: [apps/temp_20150519_190201_826701_6147/models] | |
Linking folder: [apps/temp_20150519_190201_826701_6147/views] | |
Linking folder: [apps/temp_20150519_190201_826701_6147/controllers] | |
Linking folder: [apps/temp_20150519_190201_826701_6147/static] | |
Linking folder: [apps/temp_20150519_190201_826701_6147/modules] | |
Creating file: [apps/temp_20150519_190201_826701_6147/__init__.py] | |
Copying [data/demo/infovis/corpus/corpus.db] --> [apps/temp_20150519_190201_826701_6147/databases/corpus.db] | |
Copying [data/demo/infovis/corpus/corpus.txt] --> [apps/temp_20150519_190201_826701_6147/data/corpus.txt] | |
Extracting [data/demo/infovis/corpus/corpus.txt] --> [apps/temp_20150519_190201_826701_6147/data/sentences.txt] | |
Exception in thread "main" java.lang.UnsupportedClassVersionError: corenlp_helpers/SentenceSplitter : Unsupported major.minor version 52.0 | |
at java.lang.ClassLoader.defineClass1(Native Method) | |
at java.lang.ClassLoader.defineClass(ClassLoader.java:800) | |
at java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142) | |
at java.net.URLClassLoader.defineClass(URLClassLoader.java:449) | |
at java.net.URLClassLoader.access$100(URLClassLoader.java:71) | |
at java.net.URLClassLoader$1.run(URLClassLoader.java:361) | |
at java.net.URLClassLoader$1.run(URLClassLoader.java:355) | |
at java.security.AccessController.doPrivileged(Native Method) | |
at java.net.URLClassLoader.findClass(URLClassLoader.java:354) | |
at java.lang.ClassLoader.loadClass(ClassLoader.java:425) | |
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308) | |
at java.lang.ClassLoader.loadClass(ClassLoader.java:358) | |
at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:482) | |
Copying [data/demo/infovis/model-mallet] --> [apps/temp_20150519_190201_826701_6147/data/mallet-lda] | |
Computing bag-of-words statistics | |
token_regex = \w{3,} | |
min_freq = 5 | |
min_doc_freq = 3 | |
max_freq_count = 4000 | |
max_co_freq_count = 100000 | |
Computing document-level statistics... | |
Loading corpus: apps/temp_20150519_190201_826701_6147/data/corpus.txt | |
Computing term freqs (449 docs)... | |
Computing term co-occurrences (449 docs)... | |
Saving term_texts (1526 terms)... | |
inserting 1526 rows... | |
Saving term_freqs (1526 terms)... | |
inserting 1526 rows... | |
Saving term_probs (1526 terms)... | |
inserting 1526 rows... | |
Saving term_doc_freqs (1526 terms)... | |
inserting 1526 rows... | |
Saving term_co_freqs (100000 term pairs)... | |
inserting 100000 rows... | |
Saving term_co_probs (100000 term pairs)... | |
inserting 100000 rows... | |
Saving term_g2 (100000 term pairs)... | |
inserting 100000 rows... | |
Computing sentence-level term statistics... | |
Loading corpus: apps/temp_20150519_190201_826701_6147/data/sentences.txt | |
An error occured while creating app: infovis_mallet [apps/infovis_mallet] | |
Traceback (most recent call last): | |
File "bin/read_mallet.py", line 85, in <module> | |
main() | |
File "bin/read_mallet.py", line 82, in main | |
ImportMalletLDA( args.app_name, args.model_path, args.corpus_path, args.database_path, args.quiet, args.overwrite ) | |
File "bin/read_mallet.py", line 62, in ImportMalletLDA | |
bow_computer.Execute() | |
File "/home/ethan/source/school/termite-data-server/bin/db/BOW_ComputeStats.py", line 44, in Execute | |
self.ComputeAndSaveSentenceLevelStatistics() | |
File "/home/ethan/source/school/termite-data-server/bin/db/BOW_ComputeStats.py", line 132, in ComputeAndSaveSentenceLevelStatistics | |
corpus = { docID : docTokens for docID, docTokens in reader } | |
File "/home/ethan/source/school/termite-data-server/bin/db/BOW_ComputeStats.py", line 132, in <dictcomp> | |
corpus = { docID : docTokens for docID, docTokens in reader } | |
File "/home/ethan/source/school/termite-data-server/bin/db/BOW_ComputeStats.py", line 55, in ReadCorpus | |
with open( filename, 'r' ) as f: | |
IOError: [Errno 2] No such file or directory: 'apps/temp_20150519_190201_826701_6147/data/sentences.txt' |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
hi,have you solved this problem?