Skip to main content
Solved

Formula to identify diacritics

  • June 14, 2024
  • 2 replies
  • 37 views

Forum|alt.badge.img+4

Hello,

I am working on a base that includes artists' names and a lot of them have diacritics in their first and last name. Is there a formula that I can use that could identify if there are diacritics used? It can be a simple True or False formula that I can then use a filter to isolate everyone with diacritics in their name. 

For First and Last name I am used simple Single line columns.

A list of names is important because it is used for a remapping feature on our website, so the end result for airtable needs to be a filtered view of names. I appreciate any help you can provide with this. 

Best answer by TheTimeSavingCo

Hmm diacritics as in stuff like "á"?  If so, perhaps you could try:

 

IF( ISERROR(REGEX_EXTRACT(Name, '[^\\x00-\\x7F]')), "No ", "Yes" )

 

My regex isn't that great though so hopefully someone else might have a better solution!

 

2 replies

TheTimeSavingCo
Forum|alt.badge.img+31

Hmm diacritics as in stuff like "á"?  If so, perhaps you could try:

 

IF( ISERROR(REGEX_EXTRACT(Name, '[^\\x00-\\x7F]')), "No ", "Yes" )

 

My regex isn't that great though so hopefully someone else might have a better solution!

 

j-hugg-ins
Forum|alt.badge.img+9
  • Participating Frequently
  • June 15, 2024

@TheTimeSavingCo 's answer works nicely for the full range of ascii characters. Not that it matters necessarily but I would have used a straight REGEX_MATCH which returns true if a diacritic is found in the provided string {Artist} and false if not.

REGEX_MATCH({Artist}, '[^\\x00-\\x7f]')