Skip to content

Instantly share code, notes, and snippets.

View vanatteveldt's full-sized avatar

Wouter van Atteveldt vanatteveldt

  • VU University
  • Amsterdam
View GitHub Profile
wva@amcat-production: (master) ~/ocg/mw-ocg-bundler$ bin/mw-ocg-bundler -v --php-api http://wiki.amcat.nl --api http://wiki.amcat.nl Querying
[0%] Fetching wiki configuration
[0%] Fetching wiki configuration: http://wiki.amcat.nl
[4%] Fetching wiki configuration:
[8%] Fetching wiki configuration: enwiki siteinfo
[17%] Fetching parsed articles
[17%] Fetching parsed articles: collection
[23%] Fetching parsed articles: enwiki:Querying [Parsoid, latest revision]
Retrying (1) https://en.wikipedia.org/api/rest_v1/page/html/wiki%2FQuery 404
Retrying (2) https://en.wikipedia.org/api/rest_v1/page/html/wiki%2FQuery 404
wva@amcat-production: (master) ~/ocg/mw-ocg-bundler$ cat /etc/mediawiki/parsoid/settings.js
"use strict";
exports.setup = function( parsoidConfig ) {
parsoidConfig.setInterwiki( 'amcatwiki', 'http://wiki.amcat.nl/api.php' );
// Use selective serialization (default false)
parsoidConfig.useSelser = true;
$ bin/mw-ocg-bundler -v -p amcatwiki --php-api http://wiki.amcat.nl --api http://localhost:8142 Querying
[0%] Fetching wiki configuration
[0%] Fetching wiki configuration: http://wiki.amcat.nl
[4%] Fetching wiki configuration:
[8%] Fetching wiki configuration: amcatwiki siteinfo
[17%] Fetching parsed articles
[17%] Fetching parsed articles: collection
[23%] Fetching parsed articles: amcatwiki:Querying [Parsoid, latest revision]
Retrying (1) http://wiki.amcat.nl/api/rest_v1/page/html/Querying 404
Retrying (2) http://wiki.amcat.nl/api/rest_v1/page/html/Querying 404
$ bin/mw-ocg-bundler -v -p amcatwiki --php-api http://wiki.amcat.nl/api.php --api http://localhost:8142 Querying
[0%] Fetching wiki configuration
[0%] Fetching wiki configuration: http://wiki.amcat.nl/api.php
Retrying (1) http://wiki.amcat.nl/api.php/api.php?action=query&meta=filerepoinfo&format=json 404
Retrying (2) http://wiki.amcat.nl/api.php/api.php?action=query&meta=filerepoinfo&format=json 404
Retrying (3) http://wiki.amcat.nl/api.php/api.php?action=query&meta=filerepoinfo&format=json 404
Unexpected HTTP status: 404 [object Object]
$ bin/mw-ocg-bundler -v -p amcatwiki --php-api http://wiki.amcat.nl/ --api-version=parsoid2 --api http://localhost:8142 Querying
[0%] Fetching wiki configuration
[0%] Fetching wiki configuration: http://wiki.amcat.nl/
[4%] Fetching wiki configuration:
[8%] Fetching wiki configuration: amcatwiki siteinfo
[17%] Fetching parsed articles
[17%] Fetching parsed articles: collection
[23%] Fetching parsed articles: amcatwiki:Querying [Parsoid, latest revision]
Retrying (1) http://parsoid-lb.eqiad.wikimedia.org/v2/wiki.amcat.nl/html/Querying 404
Retrying (2) http://parsoid-lb.eqiad.wikimedia.org/v2/wiki.amcat.nl/html/Querying 404
$ cat /etc/mediawiki/parsoid/settings.js
"use strict";
/*
* This is a sample configuration file.
*
* Copy this file to localsettings.js and edit that file to fit your needs.
*
* Also see the file ParserService.js for more information.
*/
$ bin/mw-ocg-bundler -v -p amcatwiki --php-api http://wiki.amcat.nl/ --a http://localhost:8142 Querying
[0%] Fetching wiki configuration
[0%] Fetching wiki configuration: http://wiki.amcat.nl/
[4%] Fetching wiki configuration:
[8%] Fetching wiki configuration: amcatwiki siteinfo
[17%] Fetching parsed articles
[17%] Fetching parsed articles: collection
[23%] Fetching parsed articles: amcatwiki:Querying [Parsoid, latest revision]
Retrying (1) http://wiki.amcat.nl/api/rest_v1/page/html/Querying 404
Retrying (2) http://wiki.amcat.nl/api/rest_v1/page/html/Querying 404
Boot successfully repaired.
Please write on a paper the following URL:
http://paste.ubuntu.com/12222673/
In case you still experience boot problem, indicate this URL to:
boot.repair@gmail.com or to your favorite support forum.
You can now reboot your computer.
from amcatclient.api import AmcatAPI
from amcat.nlp.naf import NAF_Article
from amcat.nlp.syntaxtree import _naf_to_rdf, SyntaxTree
from amcat.tools.pysoh.pysoh import SOHServer
from collections import defaultdict
import re
aid = 3958
api = AmcatAPI("http://amcat.vu.nl", "wva", "geheim!")
cluster label woorden
V28 asiel land,probleem,situatie,opvang,eigen,lang,asielzoeker,oplossing,los_op,beleid,vreemdeling
V18 banken bank,financieel,sector,risico,commissie,probleem,hoog,miljard,neem,nederlandsche bank,financien
V13 belasting hoog,betaal,inkomen,eigen,laag,bijdrage,extra,miljoen,oud,procent,euro
V46 bezuinig miljoen,miljard,euro,bezuiniging,begroting,bedrag,bezuinig,plan,extra,2013,nieuw
V21 buitenland land,nederlands,turkije,fractie,regering,egypte,eu,turks,collega,nieuw,moment
V8 Buitenland (verdragen) uitspraak,verdrag,internationaal,regering,europees,afspraak,nederlands,houd,recht,israel,loop
V27 commissies onderzoek,rapport,commissie,rol,conclusie,voer,zaak,discussie,moment,brief,ministerie
V4 Decentralisatie (bevoegdheden) gemeente,lokaal,burger,taak,overheid,provincie,verantwoordelijkheid,bestuur,probleem,rijk,gemeente_raad
V47 Decentralisatie (zorg) gemeente,nodig,wet,wmo,regel,budget,daarvoor,begeleiding,beleid,beteken,awbz