Bit mask - C / C++

Als

What's an efficient way to mask a last 3 bits of a 8-bit char and make them
all zero?

Bit-shifting is possible but not sure if it is efficient enough.

Example:

01011[010] --> 01011[000]

Thanks!

Nov 14 '05 #1

Subscribe Post Reply

25803

Joona I Palaste

Als <no****@nowhere.net> scribbled the following:

What's an efficient way to mask a last 3 bits of a 8-bit char and make them
all zero? Bit-shifting is possible but not sure if it is efficient enough. Example: 01011[010] --> 01011[000] Thanks!

Well, a really simple way would be ANDing with ~7.
For example:
unsigned char x = 0x5A; /* the same as binary 01011010 */
x = x & ~7; /* clear last 3 bits */
Is it efficient? It depends on your implementation.

--
/-- Joona Palaste (pa*****@cc.helsinki.fi) ------------- Finland --------\
\-- http://www.helsinki.fi/~palaste --------------------- rules! --------/
"C++ looks like line noise."
- Fred L. Baube III

Nov 14 '05 #2

Kevin Goodsell

Mark A. Odell wrote:

Joona I Palaste <pa*****@cc.helsinki.fi> wrote in
news:bt**********@oravannahka.helsinki.fi:
unsigned char x = 0x5A;

x &= (unsigned char) ~0x07;

It's practically the same thing, and whatever efficiency difference
there would be will probably get lost in compiler optimisation. I just
thought my version was clearer to read. Suit yourself.

I meant nothing WRT efficiency, just that I thought that &= is the more
common idiom and that you needed to cast ~0x07 to unsigned char.

Why would the cast be needed? It seems useless to me, since the result
will be promoted back to int (probably - unsigned int is theoretically
possible also) anyway.

-Kevin
--
My email address is valid, but changes periodically.
To contact me please use the address from a recent posting.

Nov 14 '05 #3

Als

Eric Sosman <Er*********@sun.com> wrote in message
news:3F***************@sun.com...

Als wrote:

What's an efficient way to mask a last 3 bits of a 8-bit char and make them all zero?

Bit-shifting is possible but not sure if it is efficient enough.

Example:

01011[010] --> 01011[000]

Many or perhaps even most C implementations use an
eight-bit `char', but that is not actually guaranteed
by the language, and implementations using wider `char'
are known to exist. Still:

unsigned char byte = 0x5A; /* 00...01011010 */

Is there any reason that you use "unsigned char" instead of "char" above?
Thanks!

Nov 14 '05 #4

pete

Als wrote:

Eric Sosman <Er*********@sun.com> wrote in message
news:3F***************@sun.com...
Als wrote:

What's an efficient way to mask a last 3 bits of a 8-bit char and make them all zero?

Bit-shifting is possible but not sure if it is efficient enough.

Example:

01011[010] --> 01011[000]

Many or perhaps even most C implementations use an
eight-bit `char', but that is not actually guaranteed
by the language, and implementations using wider `char'
are known to exist. Still:

unsigned char byte = 0x5A; /* 00...01011010 */

Is there any reason that you use "unsigned char"
instead of "char" above?

The result of bitwise operations are implementation defined
of the sign bit is set prior to or during the operation.

--
pete

Nov 14 '05 #5

Jack Klein

On 7 Jan 2004 18:33:24 GMT, "Mark A. Odell" <no****@embeddedfw.com>
wrote in comp.lang.c:

Joona I Palaste <pa*****@cc.helsinki.fi> wrote in
news:bt**********@oravannahka.helsinki.fi:
Example:

01011[010] --> 01011[000]

Thanks!

Well, a really simple way would be ANDing with ~7.
For example:
unsigned char x = 0x5A; /* the same as binary 01011010 */
x = x & ~7; /* clear last 3 bits */
Is it efficient? It depends on your implementation.

Why not do:

unsigned char x = 0x5A;

x &= (unsigned char) ~0x07;

--
- Mark ->

No gain, really. x will be promoted to either int or unsigned int,
and the numeric literal 0x5A, which has type int, will either be
unchanged or also promoted to unsigned int, before binary operation is
performed. Then the result will be converted back to unsigned char
for assignment back into x.

So casting the constant to unsigned char merely adds noise to the
source without changing a thing. If the constant were large enough
that it might actually be negative, a cast to (unsigned) could be
beneficial.

--
Jack Klein
Home: http://JK-Technology.Com
FAQs for
comp.lang.c http://www.eskimo.com/~scs/C-faq/top.html
comp.lang.c++ http://www.parashift.com/c++-faq-lite/
alt.comp.lang.learn.c-c++
http://www.contrib.andrew.cmu.edu/~a...FAQ-acllc.html

Nov 14 '05 #6

Anders Mikkelsen

Eric wrote:

Als <no****@nowhere.net> wrote:

What's an efficient way to mask a last 3 bits of a 8-bit char and make them
all zero?

unsigned char x = 90;

x = x & ~7;

I'm curious about using the ~ operator. Will the compiler recognise ~7
as a constant value, or will the compiled code include one's complement
instruction(s)?

Wouldn't this be better:

x &= 0xf8;

For me this is more readable than the above...
Regards,
Anders

Nov 14 '05 #7

pete

Anders Mikkelsen wrote:

Eric wrote:
Als <no****@nowhere.net> wrote:

What's an efficient way to mask a last 3 bits
of a 8-bit char and make them all zero?

unsigned char x = 90;

x = x & ~7;

I'm curious about using the ~ operator. Will the compiler recognise ~7
as a constant value,

It *is* a constant value.
or will the compiled code include one's complement
instruction(s)?
It can do that too, but I would expect that it wouldn't.

Wouldn't this be better:

x &= 0xf8;

For me this is more readable than the above...

That will mask an 8 bit char as OP specified,
but (x &= ~7) will work on a char of any width.

--
pete

Nov 14 '05 #8

Morris Dovey

Anders Mikkelsen wrote:

I'm curious about using the ~ operator. Will the compiler recognise ~7
as a constant value, or will the compiled code include one's complement
instruction(s)?
Hmm. Very dangerous to predict compiler behaviors - or did you
have a particular compiler in mind? It's a much a constant value
as -7 (which still says nothing about the compiler's behavior)
Wouldn't this be better:

x &= 0xf8;

For me this is more readable than the above...

Perhaps more readable for you; but what if CHAR_BIT isn't less
than nine bits? Imagine that this program is run on my DS9K
configured for 12-bit chars - what then?

--
Morris Dovey
West Des Moines, Iowa USA
C links at http://www.iedu.com/c
Read my lips: The apple doesn't fall far from the tree.

Nov 14 '05 #9

Ravi Uday

eg*************@verizon.net (Eric) wrote in message news:<1g772zd.1aoukoy5c8ubkN%eg*************@veriz on.net>...

Als <no****@nowhere.net> wrote:
What's an efficient way to mask a last 3 bits of a 8-bit char and make them
all zero?

unsigned char x = 90;

x = x & ~7;

Is there anything wrong with this one ?

x = x & 0xf8;
- Ravi

Nov 14 '05 #10

Joona I Palaste

Ravi Uday <ra*****@yahoo.com> scribbled the following:

eg*************@verizon.net (Eric) wrote in message news:<1g772zd.1aoukoy5c8ubkN%eg*************@veriz on.net>...
Als <no****@nowhere.net> wrote:
> What's an efficient way to mask a last 3 bits of a 8-bit char and make them
> all zero?
unsigned char x = 90;

x = x & ~7;

Is there anything wrong with this one ? x = x & 0xf8;

As the OP specified, no, but Eric's solution works for non-8 bit chars
too.

--
/-- Joona Palaste (pa*****@cc.helsinki.fi) ------------- Finland --------\
\-- http://www.helsinki.fi/~palaste --------------------- rules! --------/
"The large yellow ships hung in the sky in exactly the same way that bricks
don't."
- Douglas Adams

Nov 14 '05 #11

Mark A. Odell

Jack Klein <ja*******@spamcop.net> wrote in
news:gf********************************@4ax.com:

Why not do:

unsigned char x = 0x5A;

x &= (unsigned char) ~0x07;
No gain, really. x will be promoted to either int or unsigned int,
and the numeric literal 0x5A, which has type int, will either be
unchanged or also promoted to unsigned int, before binary operation is
performed. Then the result will be converted back to unsigned char
for assignment back into x.

So casting the constant to unsigned char merely adds noise to the
source without changing a thing. If the constant were large enough
that it might actually be negative, a cast to (unsigned) could be
beneficial.

I'm so used to 32-bit machines now maybe I'm tainted. Thanks Jack. So I
only need to cast when 'x' or the mask constant exceeds a value
representable signed int?

--
- Mark ->
--

Nov 14 '05 #12

pete

Jack Klein wrote:

unsigned char x = 0x5A;
No gain, really. x will be promoted to either int or unsigned int,
and the numeric literal 0x5A, which has type int, will either be
unchanged or also promoted to unsigned int, before binary operation is
performed. Then the result will be converted back to unsigned char
for assignment back into x.

There's two things wrong with that:
1 The left operand of the assignment operator doesn't get promoted.
2 Constants of type int, are not subject to integer promotions.

N869

6.5.16.1 Simple assignment
Semantics
[#2] In simple assignment (=), the value of the right
operand is converted to the type of the assignment
expression and replaces the value stored in the object
designated by the left operand.

6.5.16 Assignment operators
Semantics
[#3]
The type of an assignment expression is
the type of the left operand unless the left operand has
qualified type, in which case it is the unqualified version
of the type of the left operand.
6.3.1.1 Boolean, characters, and integers
[#2]
If an int can represent all values of the original type, the
value is converted to an int; otherwise, it is converted to
an unsigned int. These are called the integer
promotions.

--
pete

Nov 14 '05 #13

pete

pete wrote:

Jack Klein wrote:
unsigned char x = 0x5A;
No gain, really. x will be promoted to either int or unsigned int,
and the numeric literal 0x5A, which has type int, will either be
unchanged or also promoted to unsigned int,
before binary operation is performed.
Then the result will be converted back to unsigned char
for assignment back into x.
There's two things wrong with that:

unless you were talking about

x &= (unsigned char) ~0x07;

.... which I now realize that you were.
Sorry about that.

--
pete

Nov 14 '05 #14

Christopher Benson-Manica

Eric <eg*************@verizon.net> spoke thus:

x = x & ~7;

Is ~7 guaranteed to be ...11111111111000 on all systems?

--
Christopher Benson-Manica | I *should* know what I'm talking about - if I
ataru(at)cyberspace.org | don't, I need to know. Flames welcome.

Nov 14 '05 #15

Eric Sosman

Christopher Benson-Manica wrote:

Eric <eg*************@verizon.net> spoke thus:
x = x & ~7;

Is ~7 guaranteed to be ...11111111111000 on all systems?

Yes. There's this pettifogging possibility, though,
that ...1111000 could be a trap representation for `int'
(as far as I know, the only platform for which this is
true is the Deathstation 9000, and then only on alternate
Thursdays when the moon is full). For 100% safety, you
could write `~7u' instead.

... and if that's the worst thing you need to worry
about, you are to be envied.

--
Er*********@sun.com

Nov 14 '05 #16

Alexander Bartolich

begin followup to Christopher Benson-Manica:

Is ~7 guaranteed to be ...11111111111000 on all systems?

Well, on a machine running trinary logic ... perhaps using three
charge states of an atom ... where the decimal number '7' is
represented by the digits '21' ... well, how the fuck is bitwise
negation meant to work there?

Perhaps like multiplication with -1, i.e. the zero 'trit' is left
unchanged while the outer two trit values are flipped. In that case
negating '21' results in '12' since leading zeros are not modified.

Yeah.

--
Für Google, Tux und GPL!

Nov 14 '05 #17

pete

Eric Sosman wrote:

Christopher Benson-Manica wrote:

Eric <eg*************@verizon.net> spoke thus:
x = x & ~7;

Is ~7 guaranteed to be ...11111111111000 on all systems?

Yes. There's this pettifogging possibility, though,
that ...1111000 could be a trap representation for `int'
(as far as I know, the only platform for which this is
true is the Deathstation 9000, and then only on alternate
Thursdays when the moon is full). For 100% safety, you
could write `~7u' instead.

I think that's best. As a matter of policy,
I prefer to avoid bitwise operations on signed types,
unless there is a special reason.

--
pete

Nov 14 '05 #18

Peter Nilsson

Eric Sosman <Er*********@sun.com> wrote in message news:<3F***************@sun.com>...

Christopher Benson-Manica wrote:

Eric <eg*************@verizon.net> spoke thus:
x = x & ~7;

Is ~7 guaranteed to be ...11111111111000 on all systems?

Yes. There's this pettifogging possibility, though,
that ...1111000 could be a trap representation for `int'
(as far as I know, the only platform for which this is
true is the Deathstation 9000, and then only on alternate
Thursdays when the moon is full). For 100% safety, you
could write `~7u' instead.

Well, if you think that ~ can change padding bits, then ~7u isn't safe
either, since even unsigned types can have padding, and hence,
potential trap representations (apart from the uintN_t types of
course).

A case to hypothetically consider would be if INT_MIN did not have a
magnitude of 2^N or 2^N-1. In other words, if you consider that an int
can have a bizarre range like (-78269..65318).

--
Peter

Nov 14 '05 #19

pete

Peter Nilsson wrote:

Eric Sosman <Er*********@sun.com> wrote in message news:<3F***************@sun.com>...
Christopher Benson-Manica wrote:

Eric <eg*************@verizon.net> spoke thus:

> x = x & ~7;

Is ~7 guaranteed to be ...11111111111000 on all systems?

Yes. There's this pettifogging possibility, though,
that ...1111000 could be a trap representation for `int'
(as far as I know, the only platform for which this is
true is the Deathstation 9000, and then only on alternate
Thursdays when the moon is full). For 100% safety, you
could write `~7u' instead.

Well, if you think that ~ can change padding bits,
then ~7u isn't safe either, since even unsigned types can have
padding, and hence, potential trap representations
(apart from the uintN_t types of course).

He may have been refering to negative zero, instead of padding bits.

In C99 there's only 3 formats for representing negative integers,
but in C89, the representation for negative integer values
is only specified in broad terms relating to sign and value bits,
which would allow an implementation to define any particular
negative integer value representation, as negative zero.

--
pete

Nov 14 '05 #20

Richard Bos

Alexander Bartolich <al*****************@gmx.at> wrote:

begin followup to Christopher Benson-Manica:
Is ~7 guaranteed to be ...11111111111000 on all systems?

Well, on a machine running trinary logic ... perhaps using three
charge states of an atom ... where the decimal number '7' is
represented by the digits '21' ... well, how the fuck is bitwise
negation meant to work there?

Slowly. But it must, because the Standard requires it.

If you ever find a ternary computer with a C compiler, warn me, I could
do with a laugh.

Richard

Nov 14 '05 #21

Peter Nilsson

pete <pf******@mindspring.com> wrote in message news:<3F**********@mindspring.com>...

Peter Nilsson wrote:
Eric Sosman <Er*********@sun.com> wrote in message news:<3F***************@sun.com>...
Christopher Benson-Manica wrote:
>
> Eric <eg*************@verizon.net> spoke thus:
>
> > x = x & ~7;
>
> Is ~7 guaranteed to be ...11111111111000 on all systems?

Yes. There's this pettifogging possibility, though,
that ...1111000 could be a trap representation for `int'
(as far as I know, the only platform for which this is
true is the Deathstation 9000, and then only on alternate
Thursdays when the moon is full). For 100% safety, you
could write `~7u' instead.

Well, if you think that ~ can change padding bits,
then ~7u isn't safe either, since even unsigned types can have
padding, and hence, potential trap representations
(apart from the uintN_t types of course).

He may have been refering to negative zero, instead of padding bits.

In C99 there's only 3 formats for representing negative integers,
but in C89, the representation for negative integer values
is only specified in broad terms relating to sign and value bits,
which would allow an implementation to define any particular
negative integer value representation, as negative zero.

I'm not with you! How is ~7 a negative zero in C89? As I understood
it, there were no trap representations in C89, irrespective of the
representation chosen.

--
Peter

Nov 14 '05 #22

pete

Peter Nilsson wrote:

How is ~7 a negative zero in C89?

The C89 standard does not specify the representation
for the magnitudes of negative integer values.

--
pete

Nov 14 '05 #23

Peter Nilsson

"pete" <pf*****@mindspring.com> wrote in message
news:40***********@mindspring.com...

Peter Nilsson wrote:
How is ~7 a negative zero in C89?

The C89 standard does not specify the representation
for the magnitudes of negative integer values.

But the responses in DR#069, although rather cryptic (both the questions and
responses are rather carelessly mislabled), seems to rule out the
possibility of ~7 being a 'negative zero'.

Given the committee's interpretation of "pure binary numeration system", I'm
not sure what other systems are actually allowed apart from the common 3
(padding bits and 'holes' notwithstanding).

That said, I'm more than happy to stick to the golden rule of not using
signed integers for bit manipulations whenever the sign bit is, or could
become, 1. :-)

--
Peter

Nov 14 '05 #24

pete

Peter Nilsson wrote:

"pete" <pf*****@mindspring.com> wrote in message
news:40***********@mindspring.com...
Peter Nilsson wrote:
How is ~7 a negative zero in C89?
The C89 standard does not specify the representation
for the magnitudes of negative integer values.

But the responses in DR#069,
although rather cryptic (both the questions and
responses are rather carelessly mislabled), seems to rule out the
possibility of ~7 being a 'negative zero'.

I don't see anything there which would prohibit
1111 1111 1111 1000 from representing negative zero.
Given the committee's interpretation of
"pure binary numeration system", I'm
not sure what other systems are actually
allowed apart from the common 3
(padding bits and 'holes' notwithstanding).

The systems which are allowed in C89 are not specified by name,
so any system which uses the sign bit for negative values
followed by any systematic representation of the magnitudes,
would have been valid in C89.

--
pete

Nov 14 '05 #25

Similar topics

input mask on date field

by: Miranda Evans | last post by:

In my application, a text box control resides in a form. The text box control is unbound, but--assuming all edits are passed when the user click a command button on the form--the contents of the...

Microsoft Access / VBA

Help: Input Mask (with spce)

by: Paul | last post by:

hi, is there an input mask i could use on a report to do the following: (1) if i enter "THISISATEST" on my form, i want the text box on my report to display: "T H I S I S A T E S T". (2) if...

Microsoft Access / VBA

Input mask property and Mac, Mc and hyphenated names

by: MS | last post by:

The simplest input mask for peoples names is.... >L<?????????????? But what about when you have names like MacDonald, or Mary-Anne? Anyone come up with a good "all round" "idiots" mask that...

Microsoft Access / VBA

Phone Input Mask Not Working

by: johnp | last post by:

Hi, Our Tech department updated users to Office 2003 this week. Now the input mask in one of the applications is showing up as: (###) ###-### The input mask wizard works correctly when I...

Microsoft Access / VBA

Change Input mask in code

by: F. Michael Miller | last post by:

I have a db with Access front end, sql back, linked tables. I need to be able to change input masks at the table level in code. Any ideas? Thanks!

Microsoft Access / VBA

1st character of field Uppercased only with input mask?

by: AA Arens | last post by:

When I want the first character of a field to be Uppercased, I need to make an input mask, like >L< followed by ??????? for example. But this mask creates ____ in an unfilled field, which I don't...

Microsoft Access / VBA

How to find lowest bit position of mask with macro only

by: ex laguna | last post by:

How do I find the lowest bit position of a mask using only macros? I want to do everything in compile time. That mean, there cannot be control statements such as if, while, for, etc. in the...

C / C++

VB2005 - Databound MaskedTextBox controls lose mask in MdiChild Forms.

by: Matt | last post by:

I recently came across what I believe to be a peculiar bug with Mdi Children, and wanted to see if anyone else had experienced this before sending it in. The issue is as follows: I have a number...

Visual Basic .NET

pil, histogram, mask

by: Daniel Nogradi | last post by:

How does one do a histogram on only a part of an image? This is what I found in the PIL documentation about histogram( ): """ im.histogram(mask) =list Returns a histogram for those parts of...

Python

displaying multiple types in a Text Box

by: desklamp | last post by:

I'm a total Access newbie, please bear with me! Using Win2K/Access 2003. I'm trying to create a table in which I can store IP addresses and other information. According to Microsoft, there is no...

Microsoft Access / VBA

Cloud Servers without Credit Card and Email Registration: A Simpler Way to Get on the Cloud

by: CloudSolutions | last post by:

Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...

General

One-click Importing Excel Data into a*Database

by: ryjfgjl | last post by:

In our work, we often need to import Excel data into databases (such as MySQL, SQL Server, Oracle) for data analysis and processing. Usually, we use database tools like Navicat or the Excel import...

Microsoft Excel

Easy Steps to Fix "Canon Printer Won't Connect to WiFi Network"

by: taylorcarr | last post by:

A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...

General

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware