Navigation Menu

Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save MilesCranmer/0491b25dfc9d7e8165a12e3db8b15965 to your computer and use it in GitHub Desktop.
Save MilesCranmer/0491b25dfc9d7e8165a12e3db8b15965 to your computer and use it in GitHub Desktop.
My Writings Tracker with Git: A bash script for tracking my writings in plain tex files, calculate stats like word count, hashtags and new files, store stats to csv and commit to git.
#!/bin/bash
##################################
#
# THE ARCHIVE TRACKER
#
# REF: https://gist.github.com/markwk/c85a8a72bc8c03d0f510262bb5219a34/
#
# INTRODUCTION:
# Daily script to navigate to a directory of plain text files,
# add files to git repo, calculate key stats, store stats to csv
# commit to git with message and do a local mac push notification.
#
# Designed to work with note files from The Archive (https://zettelkasten.de/the-archive/)
# but should work with any directory of plain text files.
#
# by Mark Koester
# github.com/markwk
# markwkoester@gmail.com
#
##################################
# SETUP
# [If haven't already, install git locally]
#
# 1. Either make your current files directory git repo
# Or create a cope directory of files and make it a git repo
#
# 2. Copy this script and create file into a local directory as archive-daily-git-commit.bash
# 3. Make file executable by setting permissions with commmand:
# $ chmod +x archive-daily-git-commit.bash
#
# 4. Configure Directory references below.
# 5. Edit cron to run daily by running command:
# $ crontab -e
# add line like this which runs at 1am daily:
# 00 1 * * * /path/to/bash/writings-tracking-with-git.bash
#
##################################
# CONFIGURATION
#
# Uncomment and set your target directories.
#
ARCHIVE_DIR="/path/to/directory/of/plain/text/files/"
COPIED_DIR="path/to/directory/of/repo/"
DATA_FILE="path/to//writing-notes-stats.csv" # be sure to create this file first with headers
#
##################################
CURRENTDATETIME=`date +"%Y-%m-%d %T"`
CURRENTDATE=`date +"%Y-%m-%d"`
YESTERDAY=`date -v-1d +%F`
# Copy files from current directory into repo
# Uncomment to remove copying
cd "$ARCHIVE_DIR"
cp -f * $COPIED_DIR
# Navigate to target directory and start git staging
cd $COPIED_DIR
git add .
# Run our analysis and post to some log messages.
echo "Writing Stats Analysis for ${CURRENTDATETIME}"
# File Counts
total_files="$(ls -1q * | wc -l | tr -d '[:space:]')"
files_changed="$(git status | wc -l | tr -d '[:space:]')"
files_added="$(git status | grep 'new file' | wc -l | tr -d '[:space:]')"
files_modified="$(git status | grep 'modified' | wc -l | tr -d '[:space:]')"
files_deleted="$(git status | grep 'deleted' | wc -l | tr -d '[:space:]')"
files_renamed="$(git status | grep 'renamed' | wc -l | tr -d '[:space:]')"
echo "Files: total files: $total_files, total changed: $files_changed, added " $files_added, "modified " $files_modified, "deleted " $files_deleted, "renamed" $files_renamed
# Word Counts | Credit: https://gist.github.com/MilesCranmer/5c7d86c8740219355d2dfdb184910711
words_added=$(git diff --cached --word-diff=porcelain |grep -e"^+[^+]"|wc -w|xargs)
words_deleted=$(git diff --cached --word-diff=porcelain |grep -e"^-[^-]"|wc -w|xargs)
words_duplicated=$(git diff --cached |grep -e"^+[^+]" -e"^-[^-]"|sed -e's/.//'|sort|uniq -d|wc -w|xargs)
echo "Words: added " $words_added, "deleted " $words_deleted, "duplicated" $words_duplicated
# hashtag counts (note: excluding hashed tagged words like #1234)
hashtags_added=$(git diff --cached --word-diff=porcelain | grep -e"^+" | grep -o '#[a-zA-Z]\+'|wc -w|xargs)
hashtags_deleted=$(git diff --cached --word-diff=porcelain | grep -e"^-" | grep -o '#[a-zA-Z]\+'|wc -w|xargs)
echo "Hashtags: added " $hashtags_added, "deleted " $hashtags_deleted
# Count of citations references in special format like #123.
# Used in the case of a reference manager like Bookends or Zotero.
refs_added=$(git diff --cached --word-diff=porcelain | grep -e"^+" | grep -o '#[0-9_]\+'|wc -w|xargs)
refs_deleted=$(git diff --cached --word-diff=porcelain | grep -e"^-" | grep -o '#[0-9_]\+'|wc -w|xargs)
echo "References: added " $refs_added, "deleted " $refs_deleted
# FUTURE TODO: Loop through all modified files and get list of hashtag and count on active files
# for x in "$(git status -s | grep -o '".*"' | tr -d '"' )"; do
#name=$($x | tr -d '"')
#file="$ARCHIVE_DIR$x"
#x+=
# echo "$x"
#open ""$x"" #| grep -o '#[a-zA-Z]\+'
#done
# Save stats as new line with date to local csv
echo ${YESTERDAY}, ${CURRENTDATETIME}, $total_files, $files_changed, $files_added, $files_modified, $files_deleted, $files_renamed, $words_added, $words_deleted, $words_duplicated, $hashtags_added, $hashtags_added, $hashtags_deleted, $refs_added, $refs_deleted >> $DATA_FILE
# Commit Changes to Git with Custom Message
commit_msg=("$YESTERDAY Daily Writing Stats: Words Added: $words_added, Files Added: $files_added")
git commit -m "$commit_msg"
git push origin master
# Future Todo: Improved Message with actual stats
osascript -e 'display notification "Successfully copied writing files and logged stats into git." with title "Yesterday Daily Writing Stats Saved"'
# FUTURE TODO: [Possible] Curl to save stats to a google sheet or another tracking api
# SEE: https://productforums.google.com/forum/#!topic/docs/18vuCI8Me10
# https://gist.github.com/joebuhlig/b0c3cd227c148685f98d/raw/74ee50a30b9b0857f251c5d753573a4ae4a6ecfa/Git%20Writing%20Tracker%20Post-Commit
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment