First of all. ML has two quite different activity domains:
- Running something on many repositories.
- Running something on a single repository
Depending on the size of (2), it makes or does not make sense to launch Spark. For example, consider the topic model application scenario: