Skip to content

Instantly share code, notes, and snippets.

#!/usr/bin/python
# -*- coding: utf-8 -*-
"""
iterate a bucket in s3 containing JSON log files
get each file, massage json a little and uploade to elasticsearc
"""
@lambdamusic
lambdamusic / split_json.sh
Created February 27, 2019 11:37
Split jsonl file by line with 10000+ lines and rename files
#!/bin/bash
FILENAME=records_111901_00000000.jsonl
split -a 5 -l 1 $FILENAME temp
n=1
for f in temp*
do
cat $f >> Part${n}.json
rm $f
((n++))
done
@lambdamusic
lambdamusic / splitter.sh
Created September 24, 2018 19:29 — forked from maesa/splitter.sh
Bash script for splitting large CSV files while keeping the header into 100 lines a piece using Split. Outputs as Part1, Part2, ... while keeping its Header
#!/bin/bash
FILENAME=filename_here.csv
HDR=$(head -1 $FILENAME)
split -l 100 $FILENAME xyz
n=1
for f in xyz*
do
if [ $n -gt 1 ]; then
echo $HDR > Part${n}.csv
fi
@lambdamusic
lambdamusic / scigraph_context.json
Last active April 4, 2024 20:37
SciGraph context (preview) #jsonld
# source: https://github.com/springernature/scigraph/blob/master/2018Q3/context/scigraph.json
{
"@context": {
"@vocab": "http://schema.org/",
"@base": null,
"sg": "http://scigraph.springernature.com/id/",
"sgo": "http://scigraph.springernature.com/ontologies/core/",
"rdf": "http://www.w3.org/1999/02/22-rdf-syntax-ns#",
"rdfs": "http://www.w3.org/2000/01/rdf-schema#",
@lambdamusic
lambdamusic / grant.jsonld
Created July 19, 2018 10:27
SciGraph Grants schema.org JSONLD
{
"@context": {
"@vocab": "http://schema.org/",
"@base": null,
"sg.articles": "http://scigraph.springernature.com/things/articles/",
"sg.persons": "http://scigraph.springernature.com/things/persons/",
"sg.grants": "http://scigraph.springernature.com/things/grants/",
"sg": "http://scigraph.springernature.com/ontologies/core/"
},
"@id": "sg.grants:grant.2691278",
# Obtain the label of a given class (:class1).
SELECT DISTINCT ?c (STR(?l) AS ?lb)
WHERE {
?c a :class1 ;
<http://www.w3.org/2000/01/rdf-schema#label> ?l .
}
# Obtain a list of classes.
SELECT DISTINCT ?c
WHERE {
@lambdamusic
lambdamusic / querymanager.py
Created February 22, 2018 16:03
Python Index Slicing implementation
def __getitem__(self, key):
"""
Basic slicing support up to 100 (QueryManager.BATCH_DEFAULT_SIZE)
https://docs.python.org/3/reference/datamodel.html#object.__getitem__
This is just quick way to get samples items, as it works only for the initial batch of results.
Use the next() method to go through all the resultset.
"""
if isinstance( key, slice ) :
@lambdamusic
lambdamusic / Django 1.8 static tips
Last active February 1, 2018 13:00
Django 1.8 static tips
# usual way
```
<link href="{% static 'libs/bootswatch3.2/{{ THEME }}/bootstrap.min.css' %}" rel="stylesheet">
```
# more manual but works if you have variables in it
```
@lambdamusic
lambdamusic / ontospy_example.py
Created May 16, 2017 11:45
ontospy example: extracting labels and descriptions
In [1]: import ontospy
In [2]: o = ontospy.Ontospy("foaf.rdf")
In [4]: c = o.classes[0]
In [5]: c
Out[5]: <Class *http://www.w3.org/2000/10/swap/pim/contact#Person*>
In [6]: c.bestLabel()
@lambdamusic
lambdamusic / mplayp.scm
Created January 6, 2017 16:29
Play pattern for midi instruments
;;;;;;;;;;;;;;; On Beat ;;;;;;;;;;;;;;;;;;;;;
(define-macro (onbeat? b of t . f)
(if (null? f)
`(if (= (modulo beat ,of) (- ,b 1))
,t)
`(if (= (modulo beat ,of) (- ,b 1))