This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
common-project-system
flasgger
jquery-selectbox
dv360
kdc
2-way-object-databinding
suppress-warnings
file-extension
mrjob
regex-replace
postman
derivative
rvg
getstaticprops
jest-mock-axios
cakephp-2.1
selenium4
node-rest-client
svgpathtools
tcpdf
sgml
info-plist
truncate
chartjs-plugin-zoom
vedo
weakhashmap
ddd-debugger
cametallayer
rete
commodore