PySpark in Docker Containers
-
Updated
Jun 22, 2022 - Dockerfile
PySpark in Docker Containers
RELK -- The Research Elastic Stack (Kafka, Beats, Zookeeper, Logstash, ElasticSearch, Kibana, Spark, & Jupyter -- All in Docker)
Using python3.6 alpine base image adds java,pandas, numpy,pyspark and spark as rundeps. This image can be used as container image when you run spark-submit on k8.
Docker Compose environment for big data research and machine learning development
Container-based inner loop development environment for Databricks
Docker Compose setup for PySpark
Docker images for spark on kubernetes
Minimalist install of pyspark on top of Red Hat UBI
Dockerized Environment for developing Geospatial applications in Python using Apache Spark, Apache Sedona and Delta Lake.
Hey there, this is a lightweight docker-image build to run pyspark and jupyter_notebook on port 8000
Spark 3.0.1 Docker images.
Add a description, image, and links to the pyspark topic page so that developers can more easily learn about it.
To associate your repository with the pyspark topic, visit your repo's landing page and select "manage topics."