Skip to content

Instantly share code, notes, and snippets.

View stephenturner's full-sized avatar

Stephen Turner stephenturner

View GitHub Profile
@stephenturner
stephenturner / uv-polars-seaborn-demo.ipynb
Created February 15, 2025 10:51
Demo using polars and seaborn in jupyter via uv
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@stephenturner
stephenturner / get-human-gene-info.R
Created December 30, 2024 14:36
Get information about human genes from RefSeq
library(tidyverse)
# Get Gene Summary info
gs_orig <- read_tsv("https://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_summary.gz")
gs <- gs_orig |>
janitor::clean_names() |>
set_names(str_replace, "number_tax_id", "tax_id") |>
filter(tax_id==9606) |>
distinct()
gs
@stephenturner
stephenturner / 2024-12-18-2024-12-24-bsky-summary-llama3.3.md
Created December 27, 2024 19:27
Summary of #Rstats posts on bluesky from 2024-12-18 to 2024-12-24 using llama3.3

Summary

Produced with llama3.3

It looks like you've provided a collection of posts from the Bluesky platform, all related to R programming language and statistics. Here's a brief summary:

Main themes:

  1. RStats community: Many users are sharing their experiences, tools, and projects related to R programming.
  2. Data visualization: Several posts showcase beautiful visualizations created using R packages like ggplot2.
@stephenturner
stephenturner / 2024-12-18-2024-12-24-bsky-summary-mistral.md
Created December 27, 2024 19:24
Summary of #Rstats posts on bluesky from 2024-12-18 to 2024-12-24 using mistral

Summary

Produced with mistral

  1. Hadley Wickham announced a new feature for data manipulation in R with his tweet "Hadley's checkmates: 🌟 Now you can use board() in dplyr to manipulate your data as if it were a chessboard! Just another reason to love working with data in R. #Rstats"

  2. StatLab shared an interesting statistic about the number of posts tagged with #RStats, saying "Today's trivia: The average number of posts per day on our platform that include the hashtag #RStats (case insensitive) for the period September 1 - December 20 (excluding CRAN Package Updates Bot) is approximately xxx. Data visualized using bskyr and ggplot2 R packages."

  3. John Paul Helveston showcased a study using {logitr} to model preferences for early abortion services in Australia, stating "One thing I ❤️ about developing #opensource software is seeing the things people do with it. Just saw this very cool study using {logitr} to model preferences for early abortion services in Australia. Check out this gorgeous plo

@stephenturner
stephenturner / 2024-12-18-2024-12-24-bsky-summary-gemma2.md
Created December 27, 2024 19:23
Summary of #Rstats posts on bluesky from 2024-12-18 to 2024-12-24 using gemma2

Summary

Produced with gemma2

This is a fantastic snapshot of the active #RStats community on Bluesky!

Here are some key observations and themes I see:

Strong Community:

  • Frequent Engagement: A lot of posts discussing R projects, packages, datasets, visualizations, and learning experiences.
@stephenturner
stephenturner / 2024-12-18-2024-12-24-bsky-post-summary-translation-claude-3-5-sonnet-20241022.md
Created December 26, 2024 11:06
Individual #Rstats post translations: 2024-12-18 to 2024-12-24 using claude-3-5-sonnet-20241022
  • Hadley Wickham (posit joins opensourcepledge): Estamos muy orgullosos de anunciar que @posit.co ha unido a opensourcepledge.com. Estamos comprometidos a gastar $2000 / desarrollador / año para apoyar software de código abierto que usamos (pero no desarrollamos): posit.co/blog/posit-p... #rstats #pydata
  • Jarrett Byrnes (tutorial on bayesian sem rstats): Este es un tutorial muy desgloseado para el uso de SEM bayesiano utilizando brms en RStats. Necesita ser revisado, pulido y tener muchas preguntas resueltas, pero creo que muchos personas lo encontrarán útil, así que... jebyrnes.github.io/bayesian_sem... (usen los problemas para hacer comentarios!)
  • Danielle Navarro (three r regex engines exist): Hay al menos tres diferentes motores de expresiones regulares en uso amplio
@stephenturner
stephenturner / 2024-12-18-2024-12-24-bsky-summary-claude-3-5-sonnet-20241022.md
Created December 25, 2024 09:07
Summary of #Rstats posts on bluesky from 2024-12-18 to 2024-12-24 using claude-3-5-sonnet-20241022

Summary

Produced with claude-3-5-sonnet-20241022

Here's a summary of the top #rstats posts on Bluesky from the past week:

Key Announcements and News:

  • Posit (formerly RStudio) joined the Open Source Pledge, committing $2000/year per developer to support open source software
  • The RStudio IDE and Posit Workbench received a major update with Positron Pro support
  • useR 2025 will be held at Duke University in August 2024
@stephenturner
stephenturner / 2024-12-23-bsky-summary-llama3.3.md
Created December 24, 2024 10:47
Summary of #Rstats posts on bluesky from 2024-12-23

Summary

Produced with llama3.3

This appears to be a compilation of social media posts from various individuals in the data science and R programming communities. Here are some observations and insights:

Common themes:

  1. R programming: Many posts mention R, indicating its widespread use in data analysis and visualization.
  2. Data visualization: Several posts share visualizations created using R libraries like ggplot2, reactable, and others.
@stephenturner
stephenturner / summarize-bluesky-atrrr-ellmer.R
Last active January 6, 2025 21:10
Code for summarizing #Rstats posts on bluesky with atrrr+ellmer
# Get top Bluesky posts for #Rstats ---------------------------------------
library(dplyr)
library(glue)
library(atrrr)
library(ellmer)
# authenticate to bluesky using vars in ~/.Rprofile
# or do this interactively with just `auth()`
auth(user = Sys.getenv("BSKY_USER"),
@stephenturner
stephenturner / 2024-12-23-bsky-summary-mistral.md
Created December 24, 2024 10:42
Summary of #Rstats posts on bluesky from 2024-12-23

Summary

Produced with mistral

  1. Nicola Peltz has a new post about using regex in R, breaking down the process with song lyrics as examples to help readers understand pattern matching and quantifiers. (Source: Data Elixir) #rstats
  2. Andrew Ba Tran, data journalist at Washington Post, is planning to analyze the boarding school data recently released by their organization in a live stream with exploratory data visualizations. (Source: Andrew Ba Tran) #rstats
  3. Kasia Banas, who is new to regex, completed a crash course and used the skills to solve the Day 3 Advent of Code challenge in R. (Source: Kasia Banas) #adventofcode
  4. John MacKinton shares an example of data in a challenging format with various inconsistencies, but notes that one single function in R can handle it all. (Source: John MacKintosh) [#rstats](https://twitter.com/hashtag/rstats?src=ha