The EMR File System (EMRFS) is an implementation of HDFS that all Amazon EMR clusters use for reading and writing regular files from Amazon EMR directly to Amazon S3.
Coming from HDFS it is very easy to implement EMRFS.
You just need to pass URI("s3://<bucket-name>")
object while getting filesystem object.
package com.joe;