This page is for gathering requirements towards a definition of a Minimum Viable Product (MVP) for a lightweight PNDA.
- No Hortonworks or Cloudera "Hadoop Provider"
- HDFS with expandable data nodes
- Small footprint
- Kubernetes orchestratable
- Zookeeper + Apache Kafka (version 0.11 1.x 2.x?)
- Apache Spark 2.x
- Jupyter Notebook 5.x with jupyter-lab support and python3 and scala kernels (beakerx extension?).