This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
  | ''' | |
| Created on Dec 19, 2013 | |
| A script to convert TMXs into parallel corpuses for machine | |
| translation (e.g. Moses: http://www.statmt.org/moses/) training. | |
| Pass in either paths to TMX files, or directories containing TMX files. | |
| The script will recursively traverse directories and process all TMXs. | |
| To perform tokenization or to filter the output, use the convert() method | 
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
  | #A ideia inicial partiu deste código https://gist.github.com/duarteguilherme/25a80afd502035c55b45ed85dc8977e0 | |
| import requests | |
| import json | |
| import csv | |
| headers = { | |
| 'Host': 'www.cnj.jus.br', | |
| 'Connection': 'keep-alive', | |
| 'Content-Length': '213', | |
| 'Accept': 'application/json, text/plain, */*', | |
| 'Origin': 'http://www.cnj.jus.br', |