from ftplib import FTP
from datetime import datetime
start = datetime.now()
ftp = FTP('your-ftp-domain-or-ip')
ftp.login('your-username','your-password')
# Get All Files
files = ftp.nlst()
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import os | |
import pyarrow.parquet as pq | |
# | |
# Warning!!! | |
# Suffers from the same problem as the parquet-tools merge function | |
# | |
#parquet-tools merge: | |
#Merges multiple Parquet files into one. The command doesn't merge row groups, | |
#just places one after the other. When used to merge many small files, the |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
""" | |
GET_SAS_AS_DASK.PY | |
2019-05-02 | |
kingfischer16 | |
Functionality to read SAS data from a SAS server (or locally) and return | |
dask.dataframe. | |
General idea: Using SASPY, build a list of pandas.DataFrames that are blocks | |
called via a SAS session. These blocks then make up the dask.DataFrame. Helper |