This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
dtsearch
apple-push-notifications
dynamic
cats-effect
google-cloud-instances
jenkins-mailer-plugin
onflow-cadence
flooding
http-status-code-504
spinnaker-cam
typeloadexception
objectbrowser
virtual-column
vue-transitions
wss4j
shopping-cart
boxplot
conditional-breakpoint
code-signing-entitlements
contention
typetraits
arguments
winsockets
troff
.mpp
nsurlsessiontask
aws-cloudshell
nested-if
github-cli
asdoc