Stephen Bridgwater wizzdm

## compress_pdf.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                wizzdm
                / compress_pdf.md
            
            
              Created
              March 22, 2022 08:15
                — forked from ahmed-musallam/compress_pdf.md
            
              
                How to compress PDF with ghostscript
              
          
    How to compress PDF using ghostscript

As a developer, it bothers me when someone sends me a large pdf file compared to the number of pages. Recently, I recieved a 12MB scanned document for just one letter-sized page... so I got to googlin, like I usually do, and found ghostscript!
to learn more abot ghostscript (gs): https://www.ghostscript.com/
What we are interested in, is the gs command line tool, which provides many options for manipulating PDF, but we are interested in compressign those large PDF's into small yet legible documents.
credit goes to this answer on askubuntu forum: https://askubuntu.com/questions/3382/reduce-filesize-of-a-scanned-pdf/3387#3387?newreg=bceddef8bc334e5b88bbfd17a6e7c4f9

  
## javascript_tutorial.html


<!DOCTYPE html>

<html>

<head>
	<script type="text/javascript" src="test.js"></script>
</head>

## web2csv.py
#!/bin/env/python
#
# Source: http://blog.webhose.io/2015/08/16/dead-simple-for-devs-python-crawler-script-for-extracting-structured-data-from-any-almost-website-into-csv/

import sys, thread, Queue, re, urllib2, urlparse, time, csv
### Set the site you want to crawl & the patterns of the fields you want to extract ###
siteToCrawl = "http://www.amazon.com/"
fields = {}
fields["Title"] = '<title>(.*?)</title>'
fields["Rating"] = 'title="(S+) out of 5 stars"'

## tryjs_dom_getelementbyid.html
<!DOCTYPE html>
<html>
<body>

<p id="intro">Hello World!</p>

<p>This example demonstrates the <b>getElementById</b> method!</p>

<p id="demo"></p>

## keybase.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                wizzdm
                / keybase.md
            
            
              Last active
              November 7, 2017 16:21
            
          
    Keybase proof

I hereby claim:

I am wizzdm on github.
I am wizzdm (https://keybase.io/wizzdm) on keybase.
I have a public key ASB9aKy5xpEHZVaqZ1sWLdabbhnNKIFEWYP1avX9AIkHpAo

To claim this, I am signing this object:

  
## HotOnGitHub.js
/**
* Fetching data from githubarchive.org into BigQuery to see what is cool and hot on github
* Author: Ido Green
* Date: 2014-April-01
*
* A post on the subject: http://wp.me/pB1lQ-19i
* More on BQ: https://developers.google.com/bigquery/
*/


## get_n_results.py
def get_n_results_dumb(q):
    r = requests.get('http://www.google.com/search',
                     params={'q': q,
                             "tbs": "li:1"})
    r.raise_for_status()
    soup = bs4.BeautifulSoup(r.text)
    s = soup.find('div', {'id': 'resultStats'}).text
    if not s:
        return 0
    m = re.search(r'([0-9,]+)', s)


	<!DOCTYPE html>

	<html>

	<head>
	<script type="text/javascript" src="test.js"></script>
	</head>
	#!/bin/env/python
	#
	# Source: http://blog.webhose.io/2015/08/16/dead-simple-for-devs-python-crawler-script-for-extracting-structured-data-from-any-almost-website-into-csv/

	import sys, thread, Queue, re, urllib2, urlparse, time, csv
	### Set the site you want to crawl & the patterns of the fields you want to extract ###
	siteToCrawl = "http://www.amazon.com/"
	fields = {}
	fields["Title"] = '<title>(.*?)</title>'
	fields["Rating"] = 'title="(S+) out of 5 stars"'
	<!DOCTYPE html>
	<html>
	<body>

	<p id="intro">Hello World!</p>

	<p>This example demonstrates the <b>getElementById</b> method!</p>

	<p id="demo"></p>
	/**
	* Fetching data from githubarchive.org into BigQuery to see what is cool and hot on github
	* Author: Ido Green
	* Date: 2014-April-01
	*
	* A post on the subject: http://wp.me/pB1lQ-19i
	* More on BQ: https://developers.google.com/bigquery/
	*/
	def get_n_results_dumb(q):
	r = requests.get('http://www.google.com/search',
	params={'q': q,
	"tbs": "li:1"})
	r.raise_for_status()
	soup = bs4.BeautifulSoup(r.text)
	s = soup.find('div', {'id': 'resultStats'}).text
	if not s:
	return 0
	m = re.search(r'([0-9,]+)', s)