Skip to content

Instantly share code, notes, and snippets.

@juanpaco
Created May 20, 2015 02:07
Show Gist options
  • Save juanpaco/de1508b030925c1d907c to your computer and use it in GitHub Desktop.
Save juanpaco/de1508b030925c1d907c to your computer and use it in GitHub Desktop.
Output running ./demo.py infovis
--------------------------------------------------------------------------------
Build a topic model (mallet) using a demo dataset (infovis)
database = data/demo/infovis/corpus
corpus = data/demo/infovis/corpus
model = data/demo/infovis/model-mallet
app = infovis_mallet
--------------------------------------------------------------------------------
Available: data/demo/infovis/corpus
Available: tools/mallet-2.0.7
Available: tools/mallet-2.0.7
Available: tools/corenlp-3.3.1
--------------------------------------------------------------------------------
Training an LDA topic model using MALLET...
corpus = data/demo/infovis/corpus/corpus.txt
model = data/demo/infovis/model-mallet
token_regex = \w{3,}
topics = 20
iters = 1000
--------------------------------------------------------------------------------
Already exists: data/demo/infovis/model-mallet
--------------------------------------------------------------------------------
Import a MALLET LDA topic model as a web2py application...
app_name = infovis_mallet
app_path = apps/infovis_mallet
model_path = data/demo/infovis/model-mallet
corpus_filename = data/demo/infovis/corpus/corpus.txt
database_filename = data/demo/infovis/corpus/corpus.db
--------------------------------------------------------------------------------
Creating app: infovis_mallet [apps/temp_20150519_190201_826701_6147]
Creating folder: [apps/temp_20150519_190201_826701_6147/data]
Creating folder: [apps/temp_20150519_190201_826701_6147/databases]
Linking folder: [apps/temp_20150519_190201_826701_6147/models]
Linking folder: [apps/temp_20150519_190201_826701_6147/views]
Linking folder: [apps/temp_20150519_190201_826701_6147/controllers]
Linking folder: [apps/temp_20150519_190201_826701_6147/static]
Linking folder: [apps/temp_20150519_190201_826701_6147/modules]
Creating file: [apps/temp_20150519_190201_826701_6147/__init__.py]
Copying [data/demo/infovis/corpus/corpus.db] --> [apps/temp_20150519_190201_826701_6147/databases/corpus.db]
Copying [data/demo/infovis/corpus/corpus.txt] --> [apps/temp_20150519_190201_826701_6147/data/corpus.txt]
Extracting [data/demo/infovis/corpus/corpus.txt] --> [apps/temp_20150519_190201_826701_6147/data/sentences.txt]
Exception in thread "main" java.lang.UnsupportedClassVersionError: corenlp_helpers/SentenceSplitter : Unsupported major.minor version 52.0
at java.lang.ClassLoader.defineClass1(Native Method)
at java.lang.ClassLoader.defineClass(ClassLoader.java:800)
at java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
at java.net.URLClassLoader.defineClass(URLClassLoader.java:449)
at java.net.URLClassLoader.access$100(URLClassLoader.java:71)
at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:482)
Copying [data/demo/infovis/model-mallet] --> [apps/temp_20150519_190201_826701_6147/data/mallet-lda]
Computing bag-of-words statistics
token_regex = \w{3,}
min_freq = 5
min_doc_freq = 3
max_freq_count = 4000
max_co_freq_count = 100000
Computing document-level statistics...
Loading corpus: apps/temp_20150519_190201_826701_6147/data/corpus.txt
Computing term freqs (449 docs)...
Computing term co-occurrences (449 docs)...
Saving term_texts (1526 terms)...
inserting 1526 rows...
Saving term_freqs (1526 terms)...
inserting 1526 rows...
Saving term_probs (1526 terms)...
inserting 1526 rows...
Saving term_doc_freqs (1526 terms)...
inserting 1526 rows...
Saving term_co_freqs (100000 term pairs)...
inserting 100000 rows...
Saving term_co_probs (100000 term pairs)...
inserting 100000 rows...
Saving term_g2 (100000 term pairs)...
inserting 100000 rows...
Computing sentence-level term statistics...
Loading corpus: apps/temp_20150519_190201_826701_6147/data/sentences.txt
An error occured while creating app: infovis_mallet [apps/infovis_mallet]
Traceback (most recent call last):
File "bin/read_mallet.py", line 85, in <module>
main()
File "bin/read_mallet.py", line 82, in main
ImportMalletLDA( args.app_name, args.model_path, args.corpus_path, args.database_path, args.quiet, args.overwrite )
File "bin/read_mallet.py", line 62, in ImportMalletLDA
bow_computer.Execute()
File "/home/ethan/source/school/termite-data-server/bin/db/BOW_ComputeStats.py", line 44, in Execute
self.ComputeAndSaveSentenceLevelStatistics()
File "/home/ethan/source/school/termite-data-server/bin/db/BOW_ComputeStats.py", line 132, in ComputeAndSaveSentenceLevelStatistics
corpus = { docID : docTokens for docID, docTokens in reader }
File "/home/ethan/source/school/termite-data-server/bin/db/BOW_ComputeStats.py", line 132, in <dictcomp>
corpus = { docID : docTokens for docID, docTokens in reader }
File "/home/ethan/source/school/termite-data-server/bin/db/BOW_ComputeStats.py", line 55, in ReadCorpus
with open( filename, 'r' ) as f:
IOError: [Errno 2] No such file or directory: 'apps/temp_20150519_190201_826701_6147/data/sentences.txt'
@liuhoward
Copy link

hi,have you solved this problem?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment