Data Engineer

7 posts

Setup Minikube

k8s

bin/spark-submit \ master k8s://https://192.168.99.100:8443 \ deploy-mode cluster \ name spark-pi \ class org.apache.spark.examples.SparkPi \ conf...

Spark SQL

spark

```txt master MASTERURL --> 运行模式 例:spark://host:port, mesos://host:port, yarn, or local.

Spark Optimization

spark

PROCESSLOCAL data is in the same JVM as the running code. This is the best locality possible NODELOCAL data is on the same node. Examples might be in...

Airflow

airflow

import airflow from airflow.models import DAG from airflow.operators.pythonoperator import PythonOperator

Light