pyspark.sql.DataFrame.head#
- DataFrame.head(n=None)[source]#
 Returns the first
nrows.New in version 1.3.0.
Changed in version 3.4.0: Supports Spark Connect.
- Parameters
 - nint, optional
 default 1. Number of rows to return.
- Returns
 - If n is supplied, return a list of 
Rowof length n - or less if the DataFrame has fewer elements.
 - If n is missing, return a single Row.
 
- If n is supplied, return a list of 
 
Notes
This method should only be used if the resulting array is expected to be small, as all the data is loaded into the driver’s memory.
Examples
>>> df = spark.createDataFrame([ ... (2, "Alice"), (5, "Bob")], schema=["age", "name"]) >>> df.head() Row(age=2, name='Alice') >>> df.head(1) [Row(age=2, name='Alice')] >>> df.head(0) []