UDF using python function in PySpark
Create and use UDF to apply custom python functions while processing dataframes
2024-10-03   1 min   workflow 2

Remove duplicate rows from dataframe
Remove duplicate rows using distinct() and dropDuplicates()
2024-09-18   1 min   guide 12 cleaning 1 workflow 2