Category "unicode"

Unicode to Arabic Win1256

I have Unicodes in my Database which were encoded from Arabic and I need some source code to convert it to actual Arabic. I have tried a lot of codes but not wo

properly logging unicode & utf-8 exceptions in python 2

I'm trying to log various exceptions from libraries in python 2.7. I find that sometimes the exceptions contain a unicode string and sometimes a utf8 bytestring

How to print unicode character U-1F4A9 'pile of poo' emoji

I am trying to print a unicode character in Ruby, specifically the pile of poo. It has a unicode value of U-1F4A9. But when I try to print "\u1F4A9" to the outp

How do I enter an emoji into a string in Vim?

Couldn't figure this out. Just need to enter 1 emoji into a string and couldn't figure it out. Unicode is not working. Tried diagraph but didn't work either.

How to convert list of bytes (unicode) to Python string?

I have a list of bytes (8 bit bytes, or in C/C++ language they form wchar_t type string), they form an UNICODE string (byte by byte), how to convert those value

os.listdir is removing character accent

In windows file explorer, create a new txt file and name it Ń.txt (note the accent over the N). Hold shift and right click the folder where you created 

asp.net mvc @ put out wrong unicode string

<meta http-equiv="Content-type" content="text/html;charset=UTF-8" /> <meta name="description" content="@("Du lịch chữa bệnh ở

(unicode error) 'unicodeescape' codec can't decode bytes in position 2-3: truncated \UXXXXXXXX escape [duplicate]

I'm trying to read a .csv file into Python (Spyder) but I keep getting an error. My code: import csv data = open("C:\Users\miche\Documents\s

How to convert string with emoji, convert emoji to string like this url with regex

How to convert string with emoji ❤️😬, convert emoji to string like this question url %EF%B8%8F with regex? function emojitourl(str) { r

Isn't a 2-byte char datatype insufficient to deal with the concept of "characters" in a Unicode string?

Various programming languages use a 2-byte char datatype (not to be confused with C/C++'s char, which is just one byte) out of which strings are constructed. Va

Initialize a String from a range of Characters in Swift

In our code, we found a bug from not writing the alphabet correctly. Instead of "0123456789abcdefghijklmnopqrstuvwxyz", we had "0123456789abcdefghijklmnoqprstuv

Example URIs for unit testing IDN domains

I am building a C# tool that does some minor manipulation on URIs. I am trying to write unit tests to makes sure the tool can support IDN domains, e.g. 例

How to add a copyright symbol in reason-react component?

I’m new to reason-react. I’m trying to put a copyright symbol in a react-reason component. I've tried <span >(ReasonReact.stringToElement("&

Python print Unicode string via 'Git Bash' gets 'UnicodeEncodeError'

in test.py i have print('Привет мир') with cmd worked as normal > python test.py ?????? ??? with

Reading Unicode file data with BOM chars in Python

I'm reading a series of source code files using Python and running into a unicode BOM error. Here's my code: bytes = min(32, os.path.getsize(filename)) raw = o

Decode UTF-8 with Javascript

I have Javascript in an XHTML web page that is passing UTF-8 encoded strings. It needs to continue to pass the UTF-8 version, as well as decode it. How is it po

JavaScript: how to check if character is RTL?

How can I programmatically check if the browser treats some character as RTL in JavaScript? Maybe creating some transparent DIV and looking at where text is pl

Standard character set for Outlook 2010 .msg file

I need to find out, what type of character set it is, if I save an email from outlook 2010 in non-unicode format. At saving you can choose between .msg file and

Regex to match word delimiters in multilingual text

I have a text box that a user can input any text in any language in and I need to split that text into words so that I could pass those words into hunspell spel

Why does QStringLiteral returns a garbled string

I'm programming a Chinese software, and embed some of the strings in the source file. To reduce runtime overhead (well, actually this is premature optimization,