I have a string with 2 phrases, separated by an upper case word in the same string: c="Text is here. TEST . More text here also" I want to separate both phrase
I have two lists: a list of about 750K "sentences" (long strings) a list of about 20K "words" that I would like to delete from my 750K sentences So, I have to l
I have the following PCRE2 regex that works to match and remove timestamp lines in a .webVTT subtitle file (the default for YouTube): ^[0-9].:[0-9].:[0-9].+$ T
I have cells containing data in google spreadsheet as quantity of some entity and I wish to extract only that string which is after the quantity value (number).
I have a dataset (z) with strings which are very long in z$txt. I also have a dictionary (incd) of keywords that need to be identified. In the column z$inc.term
I tried to extract all consecutive capitalized words in a given string written with no spacing in between. E.g. The University Of Sydney => TheUniversityOfSy
I am trying to make a regex which will validate if all the words have their first letter as upper case. ex like Fruits is allowed but fruits is not allowed. als
I am trying to convert this regex to work in Kibana in a query search and can't seem to get it to work ((\+[2-9]|[2-9]){1}\d{10,}$)|((\+[2-9]){1}\d{8,}$)|((\+{0
I have last names in an XML file that I would like to capture, which are unique. I am going off this other StackOverflow answer to start: Only match unique stri
A follow up to my previous question here. I realize I need to be more specific regarding my regex case to get an answer that works for my case. I am fighting w
I'm trying to match strings that either: Contains between 9 and 15 numbers (Only numbers) Contains between 6 and 15 numbers+letters (it must contain both, numbe
I want to strip all python docstrings out of a file using simple search and replace, and the following (extremely) simplistic regex does the job for one line do
i have a sh file, i need my python script to find specific strings and replace all the text between them in it: hello_example=( # <- this specific line "bla"
Over the years I have slowly developed a regular expression that validates most email addresses correctly, assuming they don't use an IP address as the server p
I am trying to write a regex to accept only 4 words and in between the words accept special characters like period(.), hyphen(-) and underscore(). Also if I use
Is there an expression for excluding a single character from a character range in the (arguably somewhat peculiar) Regex syntax of Microsoft Word? These don't s
I would like to remove the dots within a word, such that a.b.c.d becomes abcd, But under some conditions: There should be at least 2 dots within the word, For e
I am working on a dataframe df that has thousands of rows of junk data in which the first number is to be extracted despite irregular delimiter: dummy_numbers =
I'm trying to print the variables ccb_3, nome, data, taxa and parcela using the function I defined as "ext_ccb", but when I run the code it returns 3 times (bec
I have the following string: text='2-05-13 15:31:48.253 StockSpy Realtime Stocks Quote[5353:207339] Websocket: symbols: ({change = "0.5250015";changePercent = "