Anatoliy Plastinin antlypls

## README.md

      
              2 files
            
          
              0 forks
            
          
                0 comments
              
            
              0 stars
            
          
                antlypls
                / README.md
            
            
              Created
              May 2, 2016 21:38
                — forked from dannguyen/README.md
            
              
                Using Python 3.x and Google Cloud Vision API to OCR scanned documents to extract structured data
              
          
    Using Python 3 + Google Cloud Vision API's OCR to extract text from photos and scanned documents

Just a quickie test in Python 3 (using Requests) to see if Google Cloud Vision can be used to effectively OCR a scanned data table and preserve its structure, in the way that products such as ABBYY FineReader can OCR an image and provide Excel-ready output.
The short answer: No. While Cloud Vision provides bounding polygon coordinates in its output, it doesn't provide it at the word or region level, which would be needed to then calculate the data delimiters.
On the other hand, the OCR quality is pretty good, if you just need to identify text anywhere in an image, without regards to its physical coordinates. I've included two examples:
####### 1. A low-resolution photo of road signs

  
## s3.sh
# You don't need Fog in Ruby or some other library to upload to S3 -- shell works perfectly fine
# This is how I upload my new Sol Trader builds (http://soltrader.net)
# Based on a modified script from here: http://tmont.com/blargh/2014/1/uploading-to-s3-in-bash

S3KEY="my aws key"
S3SECRET="my aws secret" # pass these in

function putS3
{
  path=$1

## onchange.sh
#!/bin/bash
#
# Watch current directory (recursively) for file changes, and execute
# a command when a file or directory is created, modified or deleted.
#
# Written by: Senko Rasic <senko.rasic@dobarkod.hr>
#
# Requires Linux, bash and inotifywait (from inotify-tools package).
#
# To avoid executing the command multiple times when a sequence of

## dynamic_method_spec.rb
require 'spec/support/grep_matcher'

describe do
  disallow_presence_of pattern: "send(.*#",
    location: "app/",
    description: "Do not use dynamic method invocations",
    failure: "Please change dynamic method call to something more sane."
end

## mobile.css.scss
@mixin mobile_bg($file){
  background-image: image-url('mobile/standard/#{$file}');

  @media screen and (-webkit-min-device-pixel-ratio: 2) {
    background-image: image-url('mobile/retina/#{$file}');
    background-size: image-width('mobile/standard/#{$file}') image-height('mobile/standard/#{$file}');
  }
}

// Usage

## Guardfile
guard 'process', :name => 'shrink-images', :command => 'ruby resize-mobile-images.rb mobileretina' do
  watch /^images\/mobileretina\/.+/
end

## Locales.yml
# config/locales/en.yml

en:
  exception:
    show:
      not_found:
        title: "Not Found"
        description: "The page you were looking for does not exists."
      internal_server_error:
        title: "Internal Server Error"

## avatar_uploader.rb
# app/uploaders/avatar_uploader.rb

  process :fix_exif_rotation
  process :strip
  process :resize_to_fill => [1024, 768]
  process :quality => 90 # Percentage from 0 - 100

## call_template.rb
# create the template
template = PageOfflineTemplate.new
template.quote = quote
template.pages = quote.build_pages

# Here I render a template with layout to a string then a PDF
pdf = PDFKit.new template.render_to_string(:template=>"quotes/review.html.haml")


## passwords_controller.rb
# app/controllers/users/password_controller.rb

class Users::PasswordsController < Devise::PasswordsController
  def resource_params
    params.require(:user).permit(:email, :password, :password_confirmation)
  end
  private :resource_params
end
	# You don't need Fog in Ruby or some other library to upload to S3 -- shell works perfectly fine
	# This is how I upload my new Sol Trader builds (http://soltrader.net)
	# Based on a modified script from here: http://tmont.com/blargh/2014/1/uploading-to-s3-in-bash

	S3KEY="my aws key"
	S3SECRET="my aws secret" # pass these in

	function putS3
	{
	path=$1
	#!/bin/bash
	#
	# Watch current directory (recursively) for file changes, and execute
	# a command when a file or directory is created, modified or deleted.
	#
	# Written by: Senko Rasic <senko.rasic@dobarkod.hr>
	#
	# Requires Linux, bash and inotifywait (from inotify-tools package).
	#
	# To avoid executing the command multiple times when a sequence of
	require 'spec/support/grep_matcher'

	describe do
	disallow_presence_of pattern: "send(.*#",
	location: "app/",
	description: "Do not use dynamic method invocations",
	failure: "Please change dynamic method call to something more sane."
	end
	@mixin mobile_bg($file){
	background-image: image-url('mobile/standard/#{$file}');

	@media screen and (-webkit-min-device-pixel-ratio: 2) {
	background-image: image-url('mobile/retina/#{$file}');
	background-size: image-width('mobile/standard/#{$file}') image-height('mobile/standard/#{$file}');
	}
	}

	// Usage
	guard 'process', :name => 'shrink-images', :command => 'ruby resize-mobile-images.rb mobileretina' do
	watch /^images\/mobileretina\/.+/
	end
	# config/locales/en.yml

	en:
	exception:
	show:
	not_found:
	title: "Not Found"
	description: "The page you were looking for does not exists."
	internal_server_error:
	title: "Internal Server Error"
	# app/uploaders/avatar_uploader.rb

	process :fix_exif_rotation
	process :strip
	process :resize_to_fill => [1024, 768]
	process :quality => 90 # Percentage from 0 - 100
	# create the template
	template = PageOfflineTemplate.new
	template.quote = quote
	template.pages = quote.build_pages

	# Here I render a template with layout to a string then a PDF
	pdf = PDFKit.new template.render_to_string(:template=>"quotes/review.html.haml")
	# app/controllers/users/password_controller.rb

	class Users::PasswordsController < Devise::PasswordsController
	def resource_params
	params.require(:user).permit(:email, :password, :password_confirmation)
	end
	private :resource_params
	end