spark dataframe
时间: 2023-10-13 18:07:21 浏览: 147
sparkOptics:Spark DataFrame的光学元件
Spark DataFrame is a distributed collection of data organized into named columns. It is an abstraction layer over the lower-level RDD (Resilient Distributed Dataset) API and provides a more convenient programming interface. Spark DataFrame supports various data sources such as CSV, JSON, Parquet, Avro, and JDBC, and can perform various operations like filtering, aggregating, and joining data. It is also optimized for handling large-scale datasets and can be used for both batch and stream processing.
阅读全文