Skip to content

Instantly share code, notes, and snippets.

View abulte's full-sized avatar

Alexandre Bulté abulte

View GitHub Profile
@abulte
abulte / README.md
Created January 21, 2024 09:33
pandas-ai with mistral llms

Testing pandas-ai with mistral hosted LLMs

# patched pandas-ai working with llm module
pip install -e . ../Developer/pandas-ai
# patched llm-mistral that supports `.key` to pass creds
pip install git+https://github.com/abulte/llm-mistral.git@patch-1
echo "export MISTRAL_API_KEY=xxx" > .envrc
direnv allow
@abulte
abulte / validata-geojson-stream.py
Last active May 13, 2022 14:58
validata-geojson-stream
import collections
import json
import json_stream
import jsonschema
from jsonschema import Draft7Validator
with open("schema_amenagements_cyclables.json") as ifile:
{
"version": 8,
"name": "Basic",
"metadata": {
"mapbox:autocomposite": true,
"mapbox:type": "template"
},
"sources": {
"mapbox": {
"type": "vector",
@abulte
abulte / big-sur-python.md
Last active February 8, 2021 18:26
Big Sur python mayhem

pyenv

Install python via pyenv with bzip2 support:

xcode-select --install
brew install bzip2
export LDFLAGS="-L/usr/local/opt/bzip2/lib -L/usr/local/opt/zlib/lib"
export CPPFLAGS="-I/usr/local/opt/bzip2/include -I/usr/local/opt/zlib/include"
pyenv install -f 3.8.6
We can't make this file beautiful and searchable because it's too large.
id,title,url,description,organization
5e174be4634f414cd8a244f1,Critères de répartition des dotations versées par l’Etat aux collectivités territoriales,http://www.data.gouv.fr/fr/datasets/criteres-de-repartition-des-dotations-versees-par-letat-aux-collectivites-territoriales/,Ce jeu de données vous permet d’accéder aux principaux critères physiques et financiers utilisés pour la répartition des fonds nationaux de péréquation et pour la répartition des dotations de l’État aux collectivités territoriales. Ces principaux critères physiques et financiers sont consultables pour l’ensemble des collectivités territoriales en format csv.,Direction générale des collectivités locales
// 20200116113947
// http://localhost:5000/?q=entreprises
{
"data": [
{
"_id": "5b7ffc618b4c4169d30727e0",
"description": "\n\n\n- Pour vous abonner à notre lettre d'information **Sirene open data actualités**, suivez ce lien : https://insee.fr/fr/information/1405555\n- Pour consulter nos lettres d'information **Sirene open data actualités**, suivez ce lien : https://insee.fr/fr",
"hits_rank": 4357.4544033408165,
"nb_hits": 5919,
@abulte
abulte / config.yml
Last active May 7, 2019 09:31
cachet-monitor
api:
# cachet url
url: https://status.etalab.studio/api/v1
# cachet api token
token: vo6DRCtqGbtjiHsycM58
insecure: false
# https://golang.org/src/time/format.go#L57
date_format: 02/01/2006 15:04:05 MST
monitors:
- name: datagouvfr-website
@abulte
abulte / python-csv-fail-parsing-csv-doublequote.py
Created October 3, 2018 15:51
python-csv-fail-parsing-csv-doublequote
import re
def _guess_quote_and_delimiter(data, delimiters=None):
"""
Looks for text enclosed between two identical quotes
(the probable quotechar) which are preceded and followed
by the same character (the probable delimiter).
For example:
,'some text',
The quote with the most wins, same with the delimiter.
## The order of the types in this file is important.
## This file exposes an array that defines the order in which the types will be matched.
##
## Always ensure that the extensions (and related) are properly lowercased.
version: '1'
types:
- name: gpkg
extensions: