There is a file called path.py https://github.com/metabrainz/listenbrainz-labs/blob/master/listenbrainz_spark/path.py that contain path to directories which we need in HDFS. eg path: DATAFRAME_DIR = os.path.join('/', 'recommendation', 'dataframe')
one use of path.py is here : https://github.com/metabrainz/listenbrainz-labs/blob/master/manage.py#L57
Now, there is another file called create_dataframe.py which need path info, here: https://github.com/metabrainz/listenbrainz-labs/blob/master/listenbrainz_spark/recommendations/create_dataframes.py#L86