Filter text.contents (removing special characters)

Hi guys,

I want to extract a string from a bunch of text (here a selection for example). This text is xml tagged.

If I do selection[0].contents, it captures the text and all the special characters (XML tags, carriage return). I can check something is "wrong" cause contents.length is greater than expected (John(space)Smith > 10 characters but contents.length > 14). I am not really surprised cause I knew this behaviour.

So I tried to filter it to remove any content which is not an alphanumeric character but here is where I fail.

If I use GREP with contents.match(/[\w]+/g), it's quite perfect. But if the contents has diacritics, this pattern fails to catch them.

So I could include them in the pattern but it's really probable I miss a lot.

So my question is "how to extract the pure text from the contents ensuring I get all the diacritics if any but without carrying special characters ?

TIA Loic

Filter text.contents (removing special characters)

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

ОЧІ В ОЧІ – Синоніми – Single [iTunes Plus M4A]

ZARIA CUMMINGS

Download – The Last Ship 1ª Temporada RMVB Dublado – MEGA

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

VIDEO2BRAIN - GETTING STARTED WITH ILLUSTRATOR CS6

Storage DRS Fault won't clear

99 God Status for Whatsapp, Facebook

Top 10 FBB OnlyFans & Muscle Girl OnlyFans in 2023

NOTES ZA GENERAL CHEMISTRY ZA NGAIZA

SAHARA FLASH LIVE IN WERAGOLLA 2018-04-20

Conman who lived a life of luxury is jailed

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

The 10 Tennessee Cities With The Largest Black Population For 2021

Moondru Mudichu 21-07-2016 – Polimer tv Serial

Cheltenham man avoids prison after glassing girlfriend

Nalgonda District Police Office Mobile Numbers List in Telangana State

QUIZ: Are You Smart Enough To Be A US Marine?

Group Policy Update Monitor False alerts

Shatta Wale – You Shock Me (Prod. by Willis Beatz)