Upper/lowercase regex matching in unicode

Jason Stitt

What's the best way to match uppercase or lowercase characters with a
regular expression in a unicode-aware way? Obviously [A-Z] and [a-z]
aren't going to cut it. I thought there were character classes of the
form ::upper:: or similar syntax, but can't find them in the docs.
Maybe I'm getting it mixed up with Perl regexen.

The upper() and lower() methods do work on accented characters in a
unicode string, so there has to be some recognition of unicode case
in there somewhere.

Thanks,

Jason

Oct 19 '05 #1

Subscribe Post Reply

6461

George Sakkis

"Jason Stitt" <ja***@pengale.com> wrote:

What's the best way to match uppercase or lowercase characters with a
regular expression in a unicode-aware way? Obviously [A-Z] and [a-z]
aren't going to cut it. I thought there were character classes of the
form ::upper:: or similar syntax, but can't find them in the docs.
Maybe I'm getting it mixed up with Perl regexen.

The upper() and lower() methods do work on accented characters in a
unicode string, so there has to be some recognition of unicode case
in there somewhere.

Thanks,

Jason

http://tinyurl.com/7jqgt

George

Oct 20 '05 #2

by: semovrs | last post by:

Hello, everyone! I would appreciate any input or advice on the following quite simple issue: If I search through a file list using grep -E '.*$' it will not pull files ending in JPG and files...

Perl

Lowercase/Uppercase for namespace-uri

by: R.Georges | last post by:

Hi, Someone knows if while matching a set of node for a namespace-uri, I must ignore lowercase/uppercase. For instance, I search "http://www.w3.org/1999/XSL/Transfrom" namespace node inside...

.NET Framework

Portable 'lowercase' function for stl string?

by: Steve Edwards | last post by:

Hi, I'm re-writing some code that had relied on some platform/third-party dependent utility functions, as I want to make it more portable. Is there a standard C/C++/stl routine for changing an stl...

C / C++

how to convert characters to upper case in utf8 env.

by: csanjith | last post by:

Hi, i have a situaion where i need to convert the characters entered in an text field to upper case using C. The configuration id utf8 environment in which user can enter any character (single ,...

C / C++

Regex: Finding strings in a source file

by: Bob | last post by:

I need to create a Regex to extract all strings (including quotations) from a C# or C++ source file. After being unsuccessful myself, I found this sample on the internet: ...

C# / C Sharp

Finding Upper-case characters in regexps, unicode friendly.

by: possibilitybox | last post by:

I'm trying to make a unicode friendly regexp to grab sentences reasonably reliably for as many unicode languages as possible, focusing on european languages first, hence it'd be useful to be able...

Python

Standard C Library regex performance issue

by: igor.kulkin | last post by:

I have a small utility program written in Python which works pretty slow so I've decided to implement it in C. I did some benchmarking of Python's code performance. One of the parts of the program...

C / C++

How to change the first character of Dim variable names to upper case

by: Academia | last post by:

I want to search for Dim and replace it with Dim That is, I want to change the first character of Dim variable names to upper case. I can't figure know to use Regular Expression to do that....

Visual Basic .NET

Regex Matching on Readline()

by: jwwest | last post by:

Anyone have any trouble pattern matching on lines returned by readline? Here's an example: string = "Accounting - General" pat = ".+\s-" Should match on "Accounting -". However, if I read...

Python

Easy Steps to Fix "Canon Printer Won't Connect to WiFi Network"

by: taylorcarr | last post by:

A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...

General

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Batch import of multiple excel files into the database

by: ryjfgjl | last post by:

If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...

Data Management

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

Upper/lowercase regex matching in unicode

Similar topics