A.Juneja gr8Adakron

## Sublime-Shortcuts.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                gr8Adakron
                / Sublime-Shortcuts.md
            
            
              Last active
              November 5, 2021 09:03
            
              
                Sublime text shortcuts
              
          
    Insert and Select


Cmd+D - Select a word
Cmd+Shift+Enter - Insert a line before current line
Cmd+Enter - Insert a line after current line
Cmd+L - Select current line

Delete


Cmd+Shift+K - Delete a line
Cmd+K+K - Delete from cursor to end of line.
Cmd+KBackspace - Delete from Cursor to start of line


## multiprocess_JSONtoCSV.py
#author: gr8_adakron.
#python: 3.6 (necessary for fstrings)

from subprocess import PIPE, Popen
from multiprocessing import Pool

import multiprocessing as mp
import pandas as pd

import random

## csv2json-berkeley.pl

#Author : gr8_Adakron.

use Term::ANSIColor;
use Time::HiRes qw(time);
use strict;
use warnings;
use Carp;
use POSIX ":sys_wait_h";
use Data::Dumper;

## README.md

      
              2 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                gr8Adakron
                / README.md
            
            
              Created
              August 11, 2017 06:29
                — forked from dannguyen/README.md
            
              
                Using Python 3.x and Google Cloud Vision API to OCR scanned documents to extract structured data
              
          
    Using Python 3 + Google Cloud Vision API's OCR to extract text from photos and scanned documents

Just a quickie test in Python 3 (using Requests) to see if Google Cloud Vision can be used to effectively OCR a scanned data table and preserve its structure, in the way that products such as ABBYY FineReader can OCR an image and provide Excel-ready output.
The short answer: No. While Cloud Vision provides bounding polygon coordinates in its output, it doesn't provide it at the word or region level, which would be needed to then calculate the data delimiters.
On the other hand, the OCR quality is pretty good, if you just need to identify text anywhere in an image, without regards to its physical coordinates. I've included two examples:
####### 1. A low-resolution photo of road signs

  
## csvtojson.pl
#!/usr/bin/perl
#Author: gr8_Adakron.
#--------------------- Perl Packages --------------------
use strict;
use warnings;
use JSON;
use Text::CSV;

#-------------------- Globaling Variables --------------------
my $flag_header = 1;

## zamzar_conversion.py
import requests
from requests.auth import HTTPBasicAuth
#--------------------------------------------------------------------------#
api_key = 'Put_Your_API_KEY' #your Api_key from developer.zamzar.com (Create Signup id and get api_key)
source_file = "tmp/armash.pdf" #source_file_path
target_file = "results/armash.txt" #target_file_path_and_name
target_format = "txt"  #targeted Format.
#-------------------------------------------------------------------------#


## compareList.py
import pandas as pd
List1=[10,11,12,15,16,18,19]
List2=[11,15,16,19,13]
List3=[11,12,15,19]

d = {'List1' : pd.Series(List1),'List2' : pd.Series(List2),'List3': pd.Series(List3)}

df = pd.DataFrame(d)

print(df)
	#author: gr8_adakron.
	#python: 3.6 (necessary for fstrings)

	from subprocess import PIPE, Popen
	from multiprocessing import Pool

	import multiprocessing as mp
	import pandas as pd

	import random

	#Author : gr8_Adakron.

	use Term::ANSIColor;
	use Time::HiRes qw(time);
	use strict;
	use warnings;
	use Carp;
	use POSIX ":sys_wait_h";
	use Data::Dumper;
	#!/usr/bin/perl
	#Author: gr8_Adakron.
	#--------------------- Perl Packages --------------------
	use strict;
	use warnings;
	use JSON;
	use Text::CSV;

	#-------------------- Globaling Variables --------------------
	my $flag_header = 1;
	import requests
	from requests.auth import HTTPBasicAuth
	#--------------------------------------------------------------------------#
	api_key = 'Put_Your_API_KEY' #your Api_key from developer.zamzar.com (Create Signup id and get api_key)
	source_file = "tmp/armash.pdf" #source_file_path
	target_file = "results/armash.txt" #target_file_path_and_name
	target_format = "txt" #targeted Format.
	#-------------------------------------------------------------------------#
	import pandas as pd
	List1=[10,11,12,15,16,18,19]
	List2=[11,15,16,19,13]
	List3=[11,12,15,19]

	d = {'List1' : pd.Series(List1),'List2' : pd.Series(List2),'List3': pd.Series(List3)}

	df = pd.DataFrame(d)

	print(df)