foreach enhancement

cody

What about an enhancement of foreach loops which allows a syntax like that:

foeach(int i in 1..10) { } // forward
foeach(int i in 99..2) { } // backwards
foeach(char c in 'a'..'z') { } // chars
foeach(Color c in Red..Blue) { } // using enums

It should work with all integral datatypes. Maybe we can step a bit further:

foeach(int i in 1..10, 30..100) { } // from 1 to 10 and 30 to hundred

And maybe we could also try to enhance the loop further and make it usable
for floating point types too:

foeach(float f in 1.0..9.5 : 0.5) { } // from 1.0 to 9.5 in steps of 0.5

This is far more readable as a for loop and makes the intend much clearer.
I always have to think if I write loops like "for (int i=list.Count-1; i<=0;
i--)".
This is unproductive and errorprone and hard to maintain. Thus, my proposal.

--
cody

[Freeware, Games and Humor]
www.deutronium.de.vu || www.deutronium.tk

Nov 16 '05

Subscribe Post Reply

104

7025

Daniel O'Connell [C# MVP]

"cody" <pl*************************@gmx.de> wrote in message
news:%2****************@TK2MSFTNGP11.phx.gbl...

3b) For arrays all values have to be of the same type. Array literals
take
on the numeric behaviour where the largest nessecery type is used, but
arrays cannot contain multiple types.
We also could say that for objects we make the array of a type that is
common base class of *all* of the array members and supports all
interfaces
which *all* objects in the array support:

{string, object} // array type is object
{IList, Array} // array type if object supporting IList
{Control, TextBox} // array type is Control

sounds right.
If the array contains value types *and* reference types, the type of the
array is object.
If only numeric types are involved I agree that the array should be of the
type of the biggest nessecary numeric type.
4) For enumerableExpression, every value is read out *unless* the expression
is cast to object or somesuch. This does leave a problem in that there is

no
way to generate a list of IEnumerables, but if someone wants that edge

case
they should probably use long-hand list or array syntax. Can you think of
any workarounds for this?
enumerableExpression is typed either object for IEnumerable or T for
IEnumerable<T>.

I do not like that Idea. It would mean that you cannot put any object
supporting IEnumerable into the array without splitting it up into its
items. Better would be IEnumerator instead of IEnumerable, which means
you'll have to put list.GetEnumerator() into the array so it'll be
splitted
up. But I still do not like that idea. Array have a fixed size and
allowing
IEnumerators to be inserted into array literals means that during runtime
you have to create temporary lists and copy from them into the destination
array since you cannot resize it.
e.g:

{obj1, obj2, list1}

Assumed you want to guarantee that the expressions are evaluated in the
order they appear in the declaration you have first to create a temporary
list. you put obj1 and obj2 in it. then you have to evaluate list1 and add
its contents to the list. after you've evaluated all objects you can call
.ToArray() and your array is constructed.
But again, I do not like the idea of expanding IEnumerables. If I declare
an
array literal containg 3 elements I expect that is really has 3 elements
and
not more.

The problem here is that 1...3 *would* basically be an
IEnumerable\IEnumerator. It is a sequence generator which is expressed by
enumerable objects in the runtime. Making all of this work without breaking
the rules of the language is pretty tough. I've been considering a new
operator, <-, which assigns each value in an enumerable object to a variable
5) sourceExpression is any expression that returns a type, be it
mathematical or a method call. These aren't literals in a very strict sense,
every one of these values is generated at run time(unless compiletime
generation is possible).

agreed.
6) Where possible, foreach over a list or generator will generate a for
loop. This will only occur if the list generates integers or other
structures that support > <, etc. If the compiler cannot determine a
valid
for loop or if code complexity would become to great(too many ranges or
multiple types), the compiler will instead generate the list and foreach
will operate over that lists IEnumerable implementation.

You are talking about set literals now? We should clearly separate the
short
array literal syntax and the multi-range-set-literal syntax, they have
nothing to do with each other.

I'm talking optimization of range literals. For the case foreach (int i in
1...1000) I see no reason to generate an object, instead a for loop like you
suggested is generatable. However, for code like
foreach(int i in 1...1000, 1000...1, 2...5 : .5, 3...5838) a generator
object would probably make more sense as it would *vastly* simplify the
output code and probably the break statement as well.

8) For contains checking, I'm still much more comfortable with isin over

in,
so I will likely implement that keyword as it stands. However I will
consider adding a pragma that changes support to in so that eitehr can be
experimented with.

I do not agree but you are the one who implements it :)

Yeah, there is a little bit of a boon to that sometimes, ;).

9) Experimental: If code contains the same range type twice only one
IEnumerable\IEnumerator class pair will be generated. That pair will take
parameters that determine ranges. This should cut down on class bloat
Good idea. The bad thing is that generics do not support operator
overloading which means we cannot use generics here and we must create a
new
class for each type.

However, I still think that it is still possible to implement this
sets/ranges without the creation of any classes/objects. It will maybe
make
the implemention harder but will make the code faster and do not need
temporary objects and a bloat of classes.

"if (a in [0..100, 200..300])" should only be a substitute of
"if ((a>=0&&a<=100)||(a>=200&&a<=300))",

and "foreach (a in 0..100, 200,300)" should simply be compiled to
"for (int a=0;a<=100;a++){} for (int a=200;a<=300;a++){}"

But this is only implementation detail, no programmer using the language
should be affected on how it is implemented.
A simply implementation as a proof of concept should be enough for now and
hopefully our ideas will make it into regular c# someday :)

In many situations, the shortcuts will work(and I will use them wehre
possible). However, as a underlying foundation each form of syntax does
return a value and exists as an expression unto itself:
1...1000 is "A sequence that runs from the number one to the number 1000,
inclusive" and is encapsulated in an IEnumerable(I'm not using IEnumerator
due to IEnumerator<T> not supporting Reset. to have a reusable set you need
to be able to reset the resultant enumerator, thus GetEnumerator must be
accessible).
[1...1000] is "A list generated from a enumerable sequence"
{1...1000} is "An array generated from an enumerable sequence".

As such, each can stand alone. The stand alone objects can then be used in
other situations, vastly expanding (potential) usage.

Since this is a proof-of-concept implementation, I am also looking at
slightly more complicated subjects like haskell&python like list
comprehensions(which I think effectivly become mini-iterators). My existing
implementatino ideas are too unpolished to post right now(6 pages of
rambling is to much), but the current syntax allows you to do things like
sequence unions and intersections, value mutation, value analysis, etc.
some rough samples:
[1...1000 where value%2 == 0] //returns all even numbers between 1 &1000,
inclusive
[yield Math.Pow(value,2) for 1...1000 where Math.Pow(value,2)%2 == 0]
//returns the square of each value which has an even square

But, as I said these are *very* rough and I'm not sure how or if I'll
bother. I do like the concept though and they are quite popular in python.

I'll post more on syntax reasoning if anyone is interested. --
cody

[Freeware, Games and Humor]
www.deutronium.de.vu || www.deutronium.tk

Nov 16 '05 #51

Daniel O'Connell [C# MVP]

, every value is read out *unless* the

expression
is cast to object or somesuch. This does leave a problem in that there
is

no
way to generate a list of IEnumerables, but if someone wants that edge

case
they should probably use long-hand list or array syntax. Can you think
of
any workarounds for this?
enumerableExpression is typed either object for IEnumerable or T for
IEnumerable<T>.

I do not like that Idea. It would mean that you cannot put any object
supporting IEnumerable into the array without splitting it up into its
items. Better would be IEnumerator instead of IEnumerable, which means
you'll have to put list.GetEnumerator() into the array so it'll be
splitted
up. But I still do not like that idea. Array have a fixed size and
allowing
IEnumerators to be inserted into array literals means that during runtime
you have to create temporary lists and copy from them into the
destination
array since you cannot resize it.
e.g:

{obj1, obj2, list1}

Assumed you want to guarantee that the expressions are evaluated in the
order they appear in the declaration you have first to create a temporary
list. you put obj1 and obj2 in it. then you have to evaluate list1 and
add
its contents to the list. after you've evaluated all objects you can call
.ToArray() and your array is constructed.
But again, I do not like the idea of expanding IEnumerables. If I declare
an
array literal containg 3 elements I expect that is really has 3 elements
and
not more.

The problem here is that 1...3 *would* basically be an
IEnumerable\IEnumerator. It is a sequence generator which is expressed by
enumerable objects in the runtime. Making all of this work without
breaking the rules of the language is pretty tough. I've been considering
a new operator, <-, which assigns each value in an enumerable object to a
variable

Err, I didn't finish that.

I've been considering it but Ican't find a clean way to add it. Adding
enumerable objects contents to lists and arrays is vital, and it is vital to
do with with IEnumerable. Remember that there is *no* magic in a compiler,
atleast not in a C based language compiler. Everything has to work out in
the end.

How do you express one without the other?

Nov 16 '05 #52

Daniel O'Connell [C# MVP]

"Michael C" <mi*******@optonline.net> wrote in message
news:ON**************@TK2MSFTNGP11.phx.gbl...

"cody" <pl*************************@gmx.de> wrote in message
news:ef**************@tk2msftngp13.phx.gbl...

This would all certainly be possible, without any problem. Such a
sequence
class can be written in half an hour, but that is not the point here. The
idea was to invent a new language feature which allow the programmer to loop
over multiple ranges and test wheather a value exists in a range with a
simple built in syntax and *without* creating any temporary objects.

Ahh, I thought the point was to be able to create sequences that would
allow
you to iterate over all members of the sequence/range, or multiple
sequences/ranges, using foreach.

As cody said, the original goal was to make for's easier to write, in
esscense. Its not quite the same because you can't manually advance i, but
still, that was the goal.
The goal grew, for me anyway, to permit the language to succinctly express
lists and some list operations without using explict classes. Much of what I
want to see isn't expressly possible with existing syntax and having to
write a method to do it for each situation is overkill(one method is 10
minutes maybe, but if you have to do it a hundred times a month?). A good
deal of what I'm looking at and driving towards is python\haskell like lists
and list comprehensions.

I didn't realize the only acceptable option was a language re-design. I
suppose I don't see the benefit of changing the language syntax in order
to
accomplish something that can be done in half an hour.
Language syntax extension isn't the only acceptable solution(although it is
for the original concept). However I don't think there is any harm in
exploring new syntax that would ease certain concepts in the language. After
all, there are quite a few things you don't *have* to have in the language
that are quite useful, like using, foreach, lock, switch, break, continue,
goto, else, etc. They make the language easier even though you could use
other constructs to achieve the same effective result if not the exact code.

Part of the purpose of these implementations is to suggest ideas and examine
them by using them. This allows you to really get an idea of how the code
would look and work in real scenarios. I've written a number of
modifications to the language, some were throwaway, some were 'just seeing
if I can', and others where things I would really like to see change in the
language. Its a hobby and an excercise, and maybe one of my ideas may filter
up and actually affect change. I don't think there is anything really wrong
with that, as long as the idea is a good one and is generally considered a
good one.

Considering I am going to implement this, I obviously support it. However, I
would and do appreciate any comments you have on the syntax as it stands(as
explained in other parts of this thread) as well as on its actual nessecity,
even if they are negative. For what its worth, I do happen to agree with you
that the syntax isn't absolutly vital, as everything I have planned can be
packaged up into a class(with anonymous methods list comprehensions), I just
think that readability, ease of authoring, and the optimization potential
warrents atleast implementing and using the syntax for a while. Hopefully an
implementation wil atleast spark a little more discussion about its merits
and bring in some ground support and clearly inform us that the community as
a whole doesn't like it(as opposed to simply not bothering getting into the
discussion).
Best of luck with that.

Thanks,
Michael C.

Nov 16 '05 #53

Similar topics