Skip to content

Instantly share code, notes, and snippets.

Avatar
💭
Coding...

Erreur32 Erreur32

💭
Coding...
View GitHub Profile
@Erreur32
Erreur32 / git-commit-log-stats.md
Created Jul 19, 2021 — forked from eyecatchup/git-commit-log-stats.md
Some commands to get git commit log statistics for a repository on the command line.
View git-commit-log-stats.md

git commit stats

Commands to get commit statistics for a Git repository from the command line -
using git log, git shortlog and friends.




View php-webscraping.md

Have you ever wanted to get a specific data from another website but there's no API available for it? That's where Web Scraping comes in, if the data is not made available by the website we can just scrape it from the website itself.

But before we dive in let us first define what web scraping is. According to Wikipedia:

{% blockquote %} Web scraping (web harvesting or web data extraction) is a computer software technique of extracting information from websites. Usually, such software programs simulate human exploration of the World Wide Web by either implementing low-level Hypertext Transfer Protocol (HTTP), or embedding a fully-fledged web browser, such as Internet Explorer or Mozilla Firefox. {% endblockquote %}

View Apache Better Blocking with common rules.txt
Following on from other Gists I have posted, this one shows a neat way of using Includes to centralise general blocking rules for Bad Bots, creepy crawlers and irritating IPs
see the full post at http://www.blue-bag.com/blog/apache-better-blocking-common-rules
@Erreur32
Erreur32 / httpd.conf_spiders
Created Jul 3, 2020 — forked from gplv2/httpd.conf_spiders
Apache bot control system, filter out spiders good and bad crawlers/ webspiders when they hit your server hard, like googlebot , bingbot. Block all them for specific places marked in the robots.txt to not visit (yet they do sometimes).
View httpd.conf_spiders
# To relieve servers
##Imagine a robots.txt file like this (Google understands this format):
#User-agent: *
#Disallow: /detailed
#Disallow: /?action=detailed
#Disallow: /*/detailed
#Crawl-delay: 20
##
View updater-netdata-git.sh
#!/bin/bash
#
# Script Updater for netdata
#
# - Depencies: Wring package (NPM)
#
# By Erreur32 - 2018
#
@Erreur32
Erreur32 / updater-netdata.sh
Last active Dec 21, 2018
updater-netdata.sh
View updater-netdata.sh
#!/bin/bash
#
# Script Updater for netdata
#
# /!\ NEED Depencies:
# Wring package (NPM)
# Install:
# npm install --global wring
#
# By Erreur32 - 2018 December
View Updater-plex.sh
#!/bin/bash
#####
#
# This Script will update Plex Media Server to the latest version for Ubuntu
#
# To automatically check & update plex, run "crontab -e" and add the following lines
#
# # Check for Plex Media Server Updates every day @6:00 am
# 0 6 * * * /path/you/want/update-plexmediaserver.sh
View index.html
<h1 class="alpha ">
Echo'system'
</h1>
<img class="mars" src="https://www.nasa.gov/sites/default/files/thumbnails/image/christmas2015fullmoon.jpg" alt="" />