This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
postgresql-9.1
jdedwards
uuencode
websharper
octane
l-systems
java-war
android-bluetooth
telecommunication
vscodevim
try-catch
fbx
known-hosts
tcc
microsoft-cognitive
resolution
uilistcontentconfiguration
serverless-architecture
contextily
turbo-pascal
sign-extension
prom-client
dvi
unicast
themes
docker-push
nginx-location
fastfile
microstack
non-thread-safe