How to run Newspaper (the Python 2.7 version) in an Amazon Lambda function:
- Start a new EC2 instance with the Amazon Linux AMI
sudo yum install gcc gcc-c++ libjpeg-devel zlib-devel libevent-devel libxml2-devel libxslt-devel libpng-devel
sudo yum install python27-devel python27-pip
virtualenv env
source env/bin/activate
sudo /usr/bin/easy_install lxml
pip install newspaper
nano env/local/lib/python2.7/site-packages/newspaper/settings.py
- change
DATA_DIRECTORY
variable value to'/tmp/.newspaper_scraper'
- change
zip -9 bundle.zip lambda_function.py
cd $VIRTUAL_ENV/lib/python2.7/site-packages
zip -r9 ~/bundle.zip *
cd $VIRTUAL_ENV/lib64/python2.7/site-packages
zip -r9 ~/bundle.zip *
- Upload the
bundle.zip
file to your Lambda function- This assumes a default Handler set to
lambda_function.lambda_handler
- This assumes a default Handler set to
- Delete your EC2 instance
I have a few questions. Why a new EC2 instance was created? Can it not be done on Ubuntu machine or Mac?
Do you have any suggestions for newspaper3K version?