Remove non UTF-8 characters from varchar column
Does anyone know how to remove non UTF-8 characters from string? Could not find any relevant info on web/forum.
I need to remove (replace) question mark in a diamond. Found out that ascii 56480 corresponds to E'/xa0' value and was able to remove it; but would prefer to have one code to remove all instances of these non printable characters.
Any ideas?
0
Answers
Hi,
You should remove non UTF-8 characters using an ETL tool or process. But, if the data is already in Vertica, you can use a regular expression to remove all non UTF-8 characters...
Example: