Skip to content

Instantly share code, notes, and snippets.

View pjox's full-sized avatar
Drinking coffee

Pedro Ortiz Suarez pjox

Drinking coffee
View GitHub Profile
@pjox
pjox / saturn.pretty.wat
Last active July 16, 2024 22:48
Common Crawl format example for https://en.wikipedia.org/wiki/Saturn
WARC/1.0
warc-date: 2023-10-07T23:46:25Z
warc-refers-to: <urn:uuid:49a7d90e-e82c-4229-9218-e22d7e31e2ef>
warc-target-uri: https://en.wikipedia.org/wiki/Saturn
content-type: application/json
content-length: 1435
warc-type: metadata
warc-record-id: <urn:uuid:9f86fbf2-3587-42e3-ae4d-dd4278cb09dc>
{
@pjox
pjox / countDocs.go
Created June 7, 2022 13:38
Count the Number of Documents in OSCAR 21.09 for a given language
package main
import (
"bufio"
"compress/gzip"
"errors"
"fmt"
"log"
"os"
"path/filepath"
@pjox
pjox / dedup.go
Created August 13, 2020 14:01
The deduplication script for OSCAR
package main
import (
"bufio"
"fmt"
"os"
"github.com/cespare/xxhash"
)
@pjox
pjox / installation.txt
Created August 17, 2018 18:50
Delft installation
(delft) portizsu@traces4:/data/workspace/portizsu/delft$ pip install -r requirements-gpu.txt
Collecting keras==2.1.5 (from -r requirements-gpu.txt (line 1))
Downloading https://files.pythonhosted.org/packages/ba/65/e4aff762b8696ec0626a6654b1e73b396fcc8b7cc6b98d78a1bc53b85b48/Keras-2.1.5-py2.py3-none-any.whl (334kB)
100% |████████████████████████████████| 337kB 15.6MB/s
Collecting numpy==1.14.0 (from -r requirements-gpu.txt (line 2))
Downloading https://files.pythonhosted.org/packages/dc/ac/5c270dffb864f23315e9c1f9e0a0b300c797b3c170666c031c4de42aacae/numpy-1.14.0-cp36-cp36m-manylinux1_x86_64.whl (17.2MB)
100% |████████████████████████████████| 17.2MB 3.6MB/s
Collecting pandas==0.22.0 (from -r requirements-gpu.txt (line 3))
Downloading https://files.pythonhosted.org/packages/da/c6/0936bc5814b429fddb5d6252566fe73a3e40372e6ceaf87de3dec1326f28/pandas-0.22.0-cp36-cp36m-manylinux1_x86_64.whl (26.2MB)
100% |████████████████████████████████| 26.3MB 2.4MB/s

Keybase proof

I hereby claim:

  • I am pjox on github.
  • I am pjox (https://keybase.io/pjox) on keybase.
  • I have a public key ASCrWAmI67DSvg1PtP7SFUEnxnkznjj3oB-ky6H4EyhXMwo

To claim this, I am signing this object:

@pjox
pjox / Summy.cpp
Created February 24, 2011 02:57
suma 2 números muy grandes
#include <iostream>
#include <sstream>
using namespace std;
int main() {
string s1, s2;
cout << "primer numero:" << endl;
cin >> s1;
cout << "segundo numero:" << endl;
cin >> s2;