Skip to content

Instantly share code, notes, and snippets.

@bzz
bzz / oss-llms.md
Last active October 11, 2023 14:31
Commercial-friendly, permissively licensed Open Source Large Language Models
Model Arch License Params Seq Len FP Format VRAM Infer Lib Tokenizer Comments Other flavours
@bzz
bzz / ML4SE at ICML 2023.md
Last active May 16, 2023 08:53
Machine Learning for Software Engineering papers at ICML 2023
{
"ai_tutor": {
"Author": "JushBJJ",
"name": "Mr. Ranedeer",
"version": "2.3.6",
"features": {
"personalization": {
"depth": {
"description": "This is the depth of the content the student wants to learn. A low depth will cover the basics, and generalizations while a high depth will cover the specifics, details, unfamiliar, complex, and side cases. The lowest depth level is 1, and the highest is 10.",
"depth_levels": {
@bzz
bzz / 1_README.md
Last active May 3, 2023 21:51
Embeddings for ML4Code papers from https://ml4code.github.io/ in TF Projector format
@bzz
bzz / paper-abstracts-27.04.23-emb.tsv
Last active April 27, 2023 21:05
Embeddings for ML for Code papers from https://ml4code.github.io/
We can't make this file beautiful and searchable because it's too large.
-4.888096824288368225e-02 2.042284701019525528e-03 -6.076217815279960632e-02 1.794655248522758484e-02 3.518840670585632324e-02 -9.946228563785552979e-02 -4.138710349798202515e-02 2.458935976028442383e-02 -9.456774592399597168e-02 5.039725825190544128e-02 -3.929508477449417114e-02 -7.642178796231746674e-03 5.321704596281051636e-02 3.194081038236618042e-02 6.014326214790344238e-02 8.431854099035263062e-02 -4.898108076304197311e-03 -2.944599604234099388e-03 -1.544282585382461548e-02 -9.158698469400405884e-02 2.330877445638179779e-02 3.664040938019752502e-02 -3.887851536273956299e-02 4.658725485205650330e-02 1.236499520018696785e-03 -2.896820753812789917e-02 -5.904728174209594727e-02 -3.831689059734344482e-02 3.862988203763961792e-02 4.909663111902773380e-04 -9.784634225070476532e-03 1.093933805823326111e-01 -1.426374260336160660e-02 1.259497702121734619e-01 5.247422959655523300e-03 9.101443737745285034e-02 1.028318796306848526e-02 3.331246599555015564e-02 1.219783164560794830e-02 1.863633096218109131e-02 -7.5334
@bzz
bzz / controversy-tech-quotes.md
Last active October 7, 2021 21:04
Controversial ideas in Tech that end up working well

Controversy on good ideas in technology

As a programmer, it is your job to put yourself out of business. What you do today can be automated tomorrow.

Douglas McIlroy

1940s computers: digital VS analog

Vannevar Bush, scientist, participant of the Manhattan project, AT&T board of directors, inventor of the Differential Analyzer machine, etc. While he is a conceptual father of “personal computer” or Memex in his As We May Think - The Atlantic - he was famously against digital computers, did not believe it can be built in a reliable way and was in favour of doing analog computations instead.

@bzz
bzz / ML4SE at ICLR 2021.md
Last active November 18, 2021 07:39
Machine Learning for Software Engineering papers at ICLR 2021

Accepted Papers

  1. On the Bottleneck of Graph Neural Networks and its Practical Implications

    Uri Alon, Eran Yahav. poster

  2. GraphCodeBERT: Pre-training Code Representations with Data Flow

    Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang, Shujie LIU, Long Zhou, Nan Duan, Alexey Svyatkovskiy, Shengyu Fu, Michele Tufano, Shao Kun Deng, Colin Clement, Dawn Drain, Neel Sundaresan, Jian Yin, Daxin Jiang, Ming Zhou. poster

  3. BUSTLE: Bottom-Up Program Synthesis Through Learning-Guided Exploration

Augustus Odena, Kensen Shi, David Bieber, Rishabh Singh, Charles Sutton, Hanjun Dai. spotlight

@bzz
bzz / Paul Graham essays.md
Last active January 7, 2023 18:59
Paul Graham essays 04.2021