Skip to content

Instantly share code, notes, and snippets.

View Impavidity's full-sized avatar

Peng Shi Impavidity

  • University of Waterloo
  • ON, Canada
View GitHub Profile
#include <set>
#include <map>
#include "tensorflow/core/framework/op.h"
#include "tensorflow/core/framework/shape_inference.h"
#include "tensorflow/core/framework/op_kernel.h"
using namespace tensorflow;

Nesoi

Settings

The training dataset for our system is question-answer pair. Any reasoning path or supporting facts on text or graph are latent. The goal for this project is to develop an robust and interpretable system.

  • Input: Question
  • Output: Answer, Reasoning path or Supporting Facts

Dataset Setup

sudo docker run --name dbpedia-virtuoso -p 8890:8890 -p 1111:1111 -v /data/indexes/docker-dbpedia-data:/data/indexes/docker-virtuoso/db/dumps -v /data/indexes/DBpedia-docker/virtuoso/virtuoso.ini:/data/virtuoso.ini -d tenforce/virtuoso
docker exec -it dbpedia-virtuoso bash
isql-v -U dba
ld_add('/data/indexes/docker-virtuoso/db/dumps/dbpedia/classes/dbpedia_2016-10.owl', 'http://dbpedia.org/resource/classes#');
ld_dir('/data/indexes/docker-virtuoso/db/dumps/dbpedia/core-i18n/en', '*.ttl', 'http://dbpedia.org/');
rdf_loader_run();
438336419
521414254
package io.anserini.util;
import io.anserini.collection.Collection;
import io.anserini.document.SourceDocument;
import java.io.FileOutputStream;
import java.io.ObjectOutputStream;
import java.util.Map;
import java.util.concurrent.ConcurrentHashMap;
import org.apache.commons.lang3.time.DurationFormatUtils;
import org.apache.logging.log4j.LogManager;
package io.anserini.util;
import io.anserini.collection.Collection;
import io.anserini.document.SourceDocument;
import io.anserini.index.transform.StringTransform;
import java.io.BufferedReader;
import java.io.BufferedWriter;
import java.io.FileInputStream;
import java.io.FileReader;
import java.io.FileWriter;
# apahe-jena-fuseki-3.6.0/run/configuration/all_services.ttl
@prefix : <http://localhost/jena_example/#> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix tdb: <http://jena.hpl.hp.com/2008/tdb#> .
#@prefix tdb2: <http://jena.apache.org/2016/tdb#> .
@prefix ja: <http://jena.hpl.hp.com/2005/11/Assembler#> .
@prefix text: <http://jena.apache.org/text#> .
@prefix fuseki: <http://jena.apache.org/fuseki#> .
import json
def process_dataset(path):
examples = []
with open(path) as f:
for line in f:
example = json.loads(line)
examples.append(example)
print("Reading {} examples. ".format(len(examples)))
# apahe-jena-fuseki-3.6.0/run/configuration/all_services.ttl
@prefix : <http://localhost/jena_example/#> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix tdb: <http://jena.hpl.hp.com/2008/tdb#> .
#@prefix tdb2: <http://jena.apache.org/2016/tdb#> .
@prefix ja: <http://jena.hpl.hp.com/2005/11/Assembler#> .
@prefix text: <http://jena.apache.org/text#> .
@prefix fuseki: <http://jena.apache.org/fuseki#> .

Data Loading

nohup apache-jena-3.6.0/bin/tdbloader2 --loc /tuna1/indexes/d-wikidata /tuna1/collections/wikidata/latest-all.ttl.gz > load_wikidata.log &

Service

./fuseki-server --loc ../d-wikidata/ /wikidata
#from annoy import AnnoyIndex
import math
import json
import sys
import os
sys.path.append(os.getcwd())
import math
import pickle as pkl
import random
import os.path