This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
bunny
drop-down-menu
ora-01843
robocopy
candidate
null
github-oauth
uialertaction
pushy
blobxfer
itemssource
pdp-11
tiff
calibre
trusted
theme-ui
android-theme
react-native-text
keyword-extraction
formal-methods
trimesh
go-git
mpnowplayinginfocenter
serverside-javascript
dna-sequence
lr
autocad-scripts
taskdef
printqueue
ngtable