473,383 Members | 1,870 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,383 software developers and data experts.

Seeking optimization advice

Hello all,

I have written an expression interpreter, and a profiler has told me that a
specific part of it is in need of optimization. My purpose in posting to
this newsgroup is to solicit suggestions on how I might accomplish my
intended aim more efficiently.

My interpreter supports the "if-then-else" construct. In addition, it
supports variables, which means it has a symbol table. As you probably
suspect, the symbol table is implemented using std::map<std::string,
var_value_t>.

Currently, the processing of an if-then-else proceeds as follows:

1. Save the original symbol table state; say the returned handle to the
saved state is 1

2. Parse the boolean condition and store the result in rval (note that it is
not possible for the symbol table to be altered here; I do not allow
assignment in the condition of an if-then-else)

3. Parse the "then" subexpression

4. Save the "then" symbol table state; say the returned handle to the saved
state is 2

5. Restore symbol table state 1

6. Parse the "else" subexpression

7. Save the "else" symbol table state; say the returned handle to the saved
state is 3

8. If rval is true, restore symbol table state 2, else restore symbol table
state 3
The following serves as an example expression to which this processing could
be applied (assume 'a' is an integer initialized to 0):

if true then a := 6 else a := 7

Note that the "then" and "else" subexpressions can be arbitrarily complex;
they might even be another if-then-else. And so on recursively... This
fact eliminates some potential shortcuts.

What needs to be optimized is the saving and restoring of the symbol table
states. Currently, states are saved in a std::vector<>. When state is
requested to be saved, I just push_back() the whole symbol table and return
the index as the "handle" to the saved state. When the restoration of a
state is requested, I just index the std::vector<> with the passed handle
(index) and copy back the symbol table at that index. All of this copying
of std::map<> is taking a big toll performance-wise.

I have already tried adding an undo / redo feature that allows me to operate
only upon the actual symbol table itself. Unfortunately, I quickly
discovered that this is not robust enough. State is not properly
maintained. A careful run through of the example expression above
demonstrates this.

Can anybody suggest something more efficient than my current implementation?
It is a requirement that whatever I implement remain within the realm of
standard C++.

Thank you,
Dave
Jul 22 '05 #1
1 1512
"Dave" <be***********@yahoo.com> wrote
I have written an expression interpreter, and a
profiler has told me that a specific part of it is
in need of optimization.
Smart profiler. Mine just tells me the relative execution times
of various parts of my code. ;-)
My purpose in posting to this newsgroup is to
solicit suggestions on how I might accomplish my
intended aim more efficiently.

My interpreter supports the "if-then-else" construct.
In addition, it supports variables, which means it has
a symbol table. As you probably suspect, the symbol
table is implemented using std::map<std::string, var_value_t>.
And here's your first chance at an optimization: a hash table
would be quite a bit faster, especially for large symbol tables.
It's not by chance that most compilers and interpreters use hash
tables for their symbol tables. If your particular implementation
of the standard library doesn't include a hash_map (which is
universally believed to be slated for adoption in the next
standard), you can:

a) use STLport
b) use the Dinkumware Standard C++ Library
c) use any available implementation of a hash table
d) roll your own (hash tables are actually among the most trivial
of data structures)
Currently, the processing of an if-then-else proceeds as follows:
1. Save the original symbol table state; say the returned handle to the saved state is 1

2. Parse the boolean condition and store the result in rval (note that it is not possible for the symbol table to be altered here; I do not allow assignment in the condition of an if-then-else)

3. Parse the "then" subexpression

4. Save the "then" symbol table state; say the returned handle
to the saved state is 2

5. Restore symbol table state 1

6. Parse the "else" subexpression

7. Save the "else" symbol table state; say the returned handle
to the saved state is 3

8. If rval is true, restore symbol table state 2, else restore
symbol table state 3

You'd save yourself a lot of trouble if you decoupled the parsing
from the semantic analysis. If you did, you wouldn't need to save
various states of your symbol table (very time consuming) and you
could skip over the 'then' or 'else' clauses depending on the
evaluation of the condition without impacting the symbol table or
any other program state.
What needs to be optimized is the saving and restoring
of the symbol table states.
No, it needs to be eliminated.

Can anybody suggest something more efficient than
my current implementation?
Done.
It is a requirement that whatever I implement remain
within the realm of standard C++.


Nothing that I suggested departs in any way from the C++ standard
(the standard does NOT preclude using data structures other than
the standard containers), so you're all set.

Claudio Puviani
Jul 22 '05 #2

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
by: Dave | last post by:
Hello all, I have written an expression interpreter, and a profiler has told me that a specific part of it is in need of optimization. My purpose in posting to this newsgroup is to solicit...
2
by: Joseph Geretz | last post by:
I don't know if this is the right group for my question, but I'm seeking advice from knowledgable .NET developers. Hopefully I've come to the right place. I work with a document management...
20
by: Jim | last post by:
Hi, I am hoping that someone here can help me out. I am for the first time trying to implement a page design using only CSS instead of HTML tables. I've been able to get most of the page done...
15
by: Kay Schluehr | last post by:
I have a list of strings ls = and want to create a regular expression sx from it, such that sx.match(s) yields a SRE_Match object when s starts with an s_i for one i in . There might be...
4
by: | last post by:
I am a recent college graduate and am looking for some advice on how to be a skilled C++ developer. My educational background is from a quite mediocre campus. Can anybody please explain what is...
9
by: Amod | last post by:
hi all, Kindly suggest some tips to optimize the C++ code for higher speed. regards, Amod
7
by: Joseph Geretz | last post by:
I have a Service which runs OK, but I'm abviously not starting it properly. In my OnStart event I commence a long running process which polls a database table and performs various processing. Since...
3
by: Jia Lu | last post by:
Hello all I see there are lots of flat db or db-like modules in the standard python modules. What about the keywords seeking speed of them ? (I want to put about 10000 articles with 10000...
1
by: sukatoa | last post by:
The code below, //I need help here.... public void processChangedLines(int offset, int length) throws BadLocationException { String text = getText(); ResetColor(); ...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.