UDF using python function in PySpark
Create and use UDF to apply custom python functions while processing dataframes
2024-10-03   1 min   workflow (2)

Remove duplicate rows from dataframe
Remove duplicate rows using distinct() and dropDuplicates()
2024-09-18   1 min   guide (12) cleaning (1) workflow (2)