Just a quickie test in Python 3 (using Requests) to see if Google Cloud Vision can be used to effectively OCR a scanned data table and preserve its structure, in the way that products such as ABBYY FineReader can OCR an image and provide Excel-ready output.
The short answer: No. While Cloud Vision provides bounding polygon coordinates in its output, it doesn't provide it at the word or region level, which would be needed to then calculate the data delimiters.
On the other hand, the OCR quality is pretty good, if you just need to identify text anywhere in an image, without regards to its physical coordinates. I've included two examples:
####### 1. A low-resolution photo of road signs
#!/bin/sh | |
# | |
# !!! IMPORTANT !!! | |
# As of 2017-12-14, pacaur is unmaintained (https://bbs.archlinux.org/viewtopic.php?pid=1755144#p1755144) | |
# For alternatives see the arch wiki: https://wiki.archlinux.org/index.php/AUR_helpers#Active | |
# pacaur seems to get occasional updates to fix breaking changes due to pacman updates though. | |
# | |
# If you are new to arch, I encourage you to at least read and understand what | |
# this script does befor blindley running it. | |
# That's why I didn't make a one-liner out of it so you have an easier time |
;;; emacs-zotero-bib-fetch.el --- Manage Zotero collections from emacs | |
;; | |
;; Filename: emacs-zotero-bib-fetch.el | |
;; Author: Anders Johansson, based on zotelo by Spinu Vitalie | |
;; Maintainer: Anders Johansson | |
;; Copyright (C) 2011-2014, Anders Johansson and Spinu Vitalie | |
;; Created: 1 Jul 2014 | |
;; Version: 1.2 | |
;; URL: https://github.com/andersjohansson/emacs-zotero-bib-fetch | |
;; Keywords: zotero, emacs, reftex, bibtex, bibliography manager |
An example of using the Watson Speech to Text API to translate a podcast from ProPublica: How a Reporter Pierced the Hype Behind Theranos
This is just a simpler demo of the same technique I demonstrate to make automated video supercuts in this repo: https://github.com/dannguyen/watson-word-watcher
The transcription takes just a few minutes (less if you parallelize the requests to IBM) and is free...but it isn't perfect by any means. It doesn't fare super well on proper nouns:
- Charles Ornstein's last name is transcribed as Orenstein
- John Carreyrou's last name becomes John Kerry Roo
<?xml version="1.0" encoding="UTF-8"?> | |
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd"> | |
<plist version="1.0"> | |
<dict> | |
<key>Ansi 0 Color</key> | |
<dict> | |
<key>Alpha Component</key> | |
<real>1</real> | |
<key>Blue Component</key> | |
<real>0.29019609093666077</real> |
I wanted to get a VM configured quickly for building Craft sites, this is what I did:
https://puphpet.com/#deploy-target
I want a local VM working with Vagrant + VirtualBox.