Philosophical issue - problems of distinction between code and data

Steve Jorgensen

Hi folks,

I'm posting this message because it's an issue I come up against relatively
often, but I can't find any writings on the subject, and I haven't been able
to figure out even what key words one would use to look for it.

First, in broad philosophical terms, code actually -is- data. Code is the
data that's fed into a compiler, interpreter, or microprocessor that tells it
what to do. Code execution is, then, just a form another of data processing
handled by other code or hardware operating at the next lower level of
abstraction.

Now, in more concrete, database terms...

I recently came up with a scheme for representing how the names of various
lookup item names change, merge, and split over time in a set of data imported
periodically from an Excel spreadsheet exported form another application. The
troubles are, fist, that the scheme requires a large number of tables for each
lookup table to represent the mappings, and second, that there are a large
number of groups of tables that must have essentially the same structure.
Both of these are maintenance nightmares and represent a form of code
duplciation.

The next idea I had brings us back to the topic of this message. I thought,
what if I use just one table for all the lookup, and a type code to say what
kind of lookup value it contains? Now, we end up using the database in a way
it was not intended. We now have a type code which represnts data in one
sense, but is in another sense, part of the schema. This problem becomes
materialized when you try to define relationships, because now, with just one
lookup table for multiple types of lookup, there's no way to define the
relations correctly.

In this case, I ended up solving the problem by having both general and
specific lookup tables. The general table contains all lookup text and IDs,
and has a type code. Each specific table is a 1-to- with the general lookup
table, sharing the same ID, and also has the type code, but limits the allowed
value of the type code to the one code applicable to that table's type (it has
no text because that's available from the general lookup table). Now, a
compound relationship is created from the general lookup table to all the
specific lookup tables on both ID, and type code. The magic is that, now we
have specific lookup tables that we can set up proper relationships to, and a
central table containing all lookup IDs and text values that can be managed as
a single set, plus, the model enforces that both ways of identifying a record
type will agree.

This is a specific solution to a specific problem I may never have to deal
with again, but this class of problem shows up in different disguses all the
time. An example is, in VB when you want a group of items to be part of an
array or collection, but also refer to them as named properties. VB has no
built-in mechanism for this, so you have to manually maintain the correct
mapping between the 2 or use some kind of code generator to ensure that the
resulting code is consistent. In .NET, this particular problem is solved
using code attributes and introspection, so obviously I'm not the only one
thinking about this class of problem.

Does anyone know if there is a name for a topic that deals with the whole
class of issues around the ambiguity between code and data that we run into
when trying to improve the abstractions in our programs?

Nov 12 '05 #1

Subscribe Reply

1996

Pavel Romashkin

My level of expertise is definitely insufficient to follow the entire
thread of thought here but isn't OOP exactly the approach for dealing
with recycling the code and attaching it to various data assemblies? I
have been able to use class modules for every programming need I had so
far. Indeed, if you try to use only built in Access classes (like
Tables, which, if unchecked, eventually start to multiply like
cockroaches) you run into limitations with regards to what you can do
but I have yet to see a problem I couldn't solve using custom class modules.

Pavel

Steve Jorgensen wrote:

Hi folks,

I'm posting this message because it's an issue I come up against relatively
often, but I can't find any writings on the subject, and I haven't been able
to figure out even what key words one would use to look for it.

First, in broad philosophical terms, code actually -is- data. Code is the
data that's fed into a compiler, interpreter, or microprocessor that tells it
what to do. Code execution is, then, just a form another of data processing
handled by other code or hardware operating at the next lower level of
abstraction.

Now, in more concrete, database terms...

I recently came up with a scheme for representing how the names of various
lookup item names change, merge, and split over time in a set of data imported
periodically from an Excel spreadsheet exported form another application. The
troubles are, fist, that the scheme requires a large number of tables for each
lookup table to represent the mappings, and second, that there are a large
number of groups of tables that must have essentially the same structure.
Both of these are maintenance nightmares and represent a form of code
duplciation.

The next idea I had brings us back to the topic of this message. I thought,
what if I use just one table for all the lookup, and a type code to say what
kind of lookup value it contains? Now, we end up using the database in a way
it was not intended. We now have a type code which represnts data in one
sense, but is in another sense, part of the schema. This problem becomes
materialized when you try to define relationships, because now, with just one
lookup table for multiple types of lookup, there's no way to define the
relations correctly.

In this case, I ended up solving the problem by having both general and
specific lookup tables. The general table contains all lookup text and IDs,
and has a type code. Each specific table is a 1-to- with the general lookup
table, sharing the same ID, and also has the type code, but limits the allowed
value of the type code to the one code applicable to that table's type (it has
no text because that's available from the general lookup table). Now, a
compound relationship is created from the general lookup table to all the
specific lookup tables on both ID, and type code. The magic is that, now we
have specific lookup tables that we can set up proper relationships to, and a
central table containing all lookup IDs and text values that can be managed as
a single set, plus, the model enforces that both ways of identifying a record
type will agree.

This is a specific solution to a specific problem I may never have to deal
with again, but this class of problem shows up in different disguses all the
time. An example is, in VB when you want a group of items to be part of an
array or collection, but also refer to them as named properties. VB has no
built-in mechanism for this, so you have to manually maintain the correct
mapping between the 2 or use some kind of code generator to ensure that the
resulting code is consistent. In .NET, this particular problem is solved
using code attributes and introspection, so obviously I'm not the only one
thinking about this class of problem.

Does anyone know if there is a name for a topic that deals with the whole
class of issues around the ambiguity between code and data that we run into
when trying to improve the abstractions in our programs?

Nov 12 '05 #2

David W. Fenton

no****@nospam.n ospam (Steve Jorgensen) wrote in
<d1************ *************** *****@4ax.com>:

Does anyone know if there is a name for a topic that deals with
the whole class of issues around the ambiguity between code and
data that we run into when trying to improve the abstractions in
our programs?

I can't answer your real question, but it seems to me that you've
figured out a scenario where you are happy with the fundamental
idea behind my generic lookup table:

http://www.bway.net/~dfassoc/downloa...okupAdmin.html

I was thinking about this issue the other day and I generally think
that I'm not so concerned about enforcing referential integrity on
these kinds of lookups. Really. Indeed, I've not had a single
application where use of this generic lookup structure has resulted
in incorrect data being allowed into the lookup fields. Maybe this
is because I'm the only one writing apps against these
applications. Or maybe it's because my users never feel any need or
desire to edit tables directly because my apps give them all the
functionality they need.

In your scenario, I don't see any reason why you'd be bothered by
the structure at all. Groups of entities that can be modelled in
identical data structures are, at some level, identical entities.
There is an isomorphism that you can exploit to simplify a data
structure.

I think the reason why this is surprising or feels inappropriate is
that we too often confuse the entites in our data schema with the
real entities.

I long ago stopped using more than one table per application for
storing data about people, even though those tables are actually
holding more than one functional entity type (customer contacts,
employees, etc.). This has caused no problems at all, and has
vastly simplified any number of issues (e.g., providing search
functionality for multiple entity types).

I can't see a downside to your approach. Can you?

--
David W. Fenton http://www.bway.net/~dfenton
dfenton at bway dot net http://www.bway.net/~dfassoc

Nov 12 '05 #3

Steve Jorgensen

On Wed, 07 Jan 2004 20:21:23 GMT, dX********@bway .net.invalid (David W.
Fenton) wrote:

no****@nospam. nospam (Steve Jorgensen) wrote in
<d1*********** *************** ******@4ax.com> :
Does anyone know if there is a name for a topic that deals with
the whole class of issues around the ambiguity between code and
data that we run into when trying to improve the abstractions in
our programs?
I can't answer your real question, but it seems to me that you've
figured out a scenario where you are happy with the fundamental
idea behind my generic lookup table:

http://www.bway.net/~dfassoc/downloa...okupAdmin.html

Yes. In a sense, I've actually worked around a large part of the limitation
of that approach, but my work-around also reintroduces the clutter you were
trying to eliminate when you originally proposed it.
I was thinking about this issue the other day and I generally think
that I'm not so concerned about enforcing referential integrity on
these kinds of lookups. Really. Indeed, I've not had a single
application where use of this generic lookup structure has resulted
in incorrect data being allowed into the lookup fields. Maybe this
is because I'm the only one writing apps against these
applications . Or maybe it's because my users never feel any need or
desire to edit tables directly because my apps give them all the
functionalit y they need.
As you say on the Web page, you were dealing with small lookup tables,
unlikely to need to encompass additional functionality. I'm not sure I would
have chosen to use that solution for that case, but I wouldn't say it was a
wrong choice either. Kind of a 6 of 1, 1/2 dozen of the other sort of thing.
In your scenario, I don't see any reason why you'd be bothered by
the structure at all. Groups of entities that can be modelled in
identical data structures are, at some level, identical entities.
There is an isomorphism that you can exploit to simplify a data
structure.
In analyzing whether to do it or not, I came up with lots of reasons to be
bothered by the structure, and my final, general/specific hybrid was a way to
mitigate those factors.

The main bothersome thing about this case is that the criteria for needing
central management was not the fact that these were mostly simple lookup
tables, but the fact that they all needed to be managed by the time-delta
management code. In fact, it turned out pretty quickly that one of the tables
involved would -not- be simpy a lookup table, but a table with about 30
fields. By having the central table for the time-delta-manageable instance of
the entity, and then having an indepentent table representation of each entity
type, I not only have a way to use proper DRI, but a place to put additional
columns relating only to specific entity types.

In more general terms, you could say I had an OOP proplem in a SQL database
schema. One entity is very different things in different contexts, and things
A, B, C, etc are all also thing Q even though A, B, and C are all very
different from each other.
I think the reason why this is surprising or feels inappropriate is
that we too often confuse the entites in our data schema with the
real entities.

I long ago stopped using more than one table per application for
storing data about people, even though those tables are actually
holding more than one functional entity type (customer contacts,
employees, etc.). This has caused no problems at all, and has
vastly simplified any number of issues (e.g., providing search
functionalit y for multiple entity types).
I think I agree with your answer, but not with your analysis. I say People
are really a single set, and customers, contacts, employees, etc., are roles
that people can have. To me, it seems like People are the same entity type at
both levels, and logically belong in the same set, not just the same table.
Wouldn't it be a mutable business rule whether a single Person record should
be allowed represent be both an Employee and a Customer (should attribure
changes on a sinlge person be reflected in both places)? If so, the design
probably should not treat them as part of different sets in any sense.
I can't see a downside to your approach. Can you?

At least 4 downsides remain (all fairly minor):

1. Each entity's name must have the same type, maximum size, and data entry
constraints.
2. It is not possible to use the entity's name as part of a unique
contstraint involving other fields, contained in the specific entity table
since the name is in the general-purpose table.
3. Everywhere in code that I add a row to any of these tables, I now have to
add the record to 2 tables, the general and the specific. Likewise, when I
delete, I need to make sure to delete the specific first, then the general
(can use cascade delete, but still need to make sure to delete from the
general table).
4. DRI ensures that I can't have a specific record without the general, but
it's still possible to have a general record without the specific, even though
that's an inconsistent state from the application point of view.

Nov 12 '05 #4

Steve Jorgensen

On Wed, 07 Jan 2004 09:35:25 -0700, Pavel Romashkin
<pa************ *@hotmail.com> wrote:

My level of expertise is definitely insufficient to follow the entire
thread of thought here but isn't OOP exactly the approach for dealing
with recycling the code and attaching it to various data assemblies? I
have been able to use class modules for every programming need I had so
far. Indeed, if you try to use only built in Access classes (like
Tables, which, if unchecked, eventually start to multiply like
cockroaches) you run into limitations with regards to what you can do
but I have yet to see a problem I couldn't solve using custom class modules.

After my reply to David, I realize that you are exactly right. OOP does solve
a large number of problems in this category, though clearly not all of them.
In this case, I was actually solving a polymorphism problem, but had to do it
in the context of a relational database schema where OOP constructs are not
available.

Nov 12 '05 #5

Peter Miller

On Wed, 07 Jan 2004 23:02:40 GMT, Steve Jorgensen
<no****@nospam. nospam> wrote in comp.databases. ms-access:

After my reply to David, I realize that you are exactly right. OOP does solve
a large number of problems in this category, though clearly not all of them.
In this case, I was actually solving a polymorphism problem, but had to do it
in the context of a relational database schema where OOP constructs are not
available.

Steve, perhaps its time for you to look into an object-relational
database systems...

Peter Miller
_______________ _______________ _______________ _______________
PK Solutions -- Data Recovery for Microsoft Access/Jet/SQL
Free quotes, Guaranteed lowest prices and best results
www.pksolutions.com 1.866.FILE.FIX 1.760.476.9051

Nov 12 '05 #6

XMVP

"Steve Jorgensen" <no****@nospam. nospam> wrote in message
news:d1******** *************** *********@4ax.c om...

Hi folks,

I'm posting this message because it's an issue I come up against relatively often, but I can't find any writings on the subject, and I haven't been able to figure out even what key words one would use to look for it.

First, in broad philosophical terms, code actually -is- data. Code is the
data that's fed into a compiler, interpreter, or microprocessor that tells it what to do. Code execution is, then, just a form another of data processing handled by other code or hardware operating at the next lower level of
abstraction.

Now, in more concrete, database terms...

I recently came up with a scheme for representing how the names of various
lookup item names change, merge, and split over time in a set of data imported periodically from an Excel spreadsheet exported form another application. The troubles are, fist, that the scheme requires a large number of tables for each lookup table to represent the mappings, and second, that there are a large
number of groups of tables that must have essentially the same structure.
Both of these are maintenance nightmares and represent a form of code
duplciation.

The next idea I had brings us back to the topic of this message. I thought, what if I use just one table for all the lookup, and a type code to say what kind of lookup value it contains? Now, we end up using the database in a way it was not intended. We now have a type code which represnts data in one
sense, but is in another sense, part of the schema. This problem becomes
materialized when you try to define relationships, because now, with just one lookup table for multiple types of lookup, there's no way to define the
relations correctly.

In this case, I ended up solving the problem by having both general and
specific lookup tables. The general table contains all lookup text and IDs, and has a type code. Each specific table is a 1-to- with the general lookup table, sharing the same ID, and also has the type code, but limits the allowed value of the type code to the one code applicable to that table's type (it has no text because that's available from the general lookup table). Now, a
compound relationship is created from the general lookup table to all the
specific lookup tables on both ID, and type code. The magic is that, now we have specific lookup tables that we can set up proper relationships to, and a central table containing all lookup IDs and text values that can be managed as a single set, plus, the model enforces that both ways of identifying a record type will agree.

This is a specific solution to a specific problem I may never have to deal
with again, but this class of problem shows up in different disguses all the time. An example is, in VB when you want a group of items to be part of an array or collection, but also refer to them as named properties. VB has no built-in mechanism for this, so you have to manually maintain the correct
mapping between the 2 or use some kind of code generator to ensure that the resulting code is consistent. In .NET, this particular problem is solved
using code attributes and introspection, so obviously I'm not the only one
thinking about this class of problem.

Does anyone know if there is a name for a topic that deals with the whole
class of issues around the ambiguity between code and data that we run into when trying to improve the abstractions in our programs?

Thanks for the update on medical marijuana use!

Nov 12 '05 #7

Steve Jorgensen

On Thu, 08 Jan 2004 00:31:51 GMT, Peter Miller <pm*****@pksolu tions.com>
wrote:

On Wed, 07 Jan 2004 23:02:40 GMT, Steve Jorgensen
<no****@nospam .nospam> wrote in comp.databases. ms-access:
After my reply to David, I realize that you are exactly right. OOP does solve
a large number of problems in this category, though clearly not all of them.
In this case, I was actually solving a polymorphism problem, but had to do it
in the context of a relational database schema where OOP constructs are not
available.

Steve, perhaps its time for you to look into an object-relational
database systems...

Yeah, I'm starting to realize that, but in this case, there was exactly one
type of polymorphism problem and I found a solution. My post here was really
a question to see if anyone knew of resources or search terms about the
broader topic of data vs code represenation of rules and structure in
applications, and ways to handle cases where some aspect of the app is better
handled as code or schema structure in for some purposes, and as data for
other purposes.

Now that I look at it, though, I think I got it. Most of what I'm looking at
is really in the category of polymorphism, and OO is the only model we really
have for dealing with that.

Nov 12 '05 #8

XDPM

"XMVP" <ac***********@ hotmail.com> wrote

Thanks for the update on medical marijuana use!

Just as expected from an access_moron. Better go spend some time at
http://www.donny.com and get your head straight.

XDPM

Nov 12 '05 #9

Similar topics

1553

Philosophical Programmin Question

by: dracolytch | last post by:

Alright, this is a controvercial question for my fellow gurus: Why do we bother with Object-Oriented programming in a web environment? The web, by nature, is a stateless environment. Web-based applications are, on the whole, fairly uncomplicated things. Objects are, generally, conceptually thought of as entities that have a lifespan. Having...

PHP

1561

Inheritance design issue

by: Mark O'Flynn | last post by:

I would like some advice regarding implementing inheritance using vb.net. I have an application written in vb6 that I am rewritting from the ground up in vb.net to take full advantage of .net, but I have encountered a design issue implementing inheritance. To explain my question, I will use an example. I have a Person class, with...

Visual Basic .NET

2347

Philosophical reason behind the concept of friend function

by: Madhav | last post by:

Hi all, I am a newbie in c++. I want to know what is the philosophical reason behind the existence of friend functions. I thought giving access to private data to a function which is not a member of the class is a violation of encapsulation. Thanks, Madhav.

C / C++

1552

Philosophical question about separating tiers

by: Ronald S. Cook | last post by:

We have a Windows app which contains UI code and all classes that perform business logic and make calls to database stored procs (located on a database server - i.e. not local to the app). My boss wants to bring all those classes to a business server and out of each instance of the Windows application (i.e. separate into a business tier). ...

C# / C Sharp

2109

What is the "philosophical" meaning of const?

by: JiÅ™Ã PaleÄek | last post by:

Hello, I know the rules for const handling in C++, but I'd like to ask what is the "right" way to use them, eg. when is it appropriate to make a member function const? This came across this question when I was thinking about implementation of a class implementing some lazy data structure or cache. The C++ rules allow among other...

C / C++

2584

Wizard Control Issue

by: raghav | last post by:

Hi I am working on ASP.NET 2.0. I am developing a website using Wizard control. Based on number of steps added, next, previous, finish buttons generate automatically. After running the application, these button work automatically, means we can go to next step, previous step by clicking on corresponding buttons. In my application I have 3...

ASP.NET

1954

N-tier application - Design issue

by: mehdi_mousavi | last post by:

Hi folks, In an N-tier application, what is the possible values of N??? I'm not kidding, I just interviewed with a programmer today, and he started the "2-tier application" conversation. From then on, I heard about "5-tier application" and then "7-tier application".... I do know what is a 3-tier application, but what about 2, 5, and 7??? Does...

C# / C Sharp

2068

Can there be problems when a class is multiply inherited from 1000s of classes

by: Janaka Perera | last post by:

Hi All, We have done a object oriented design for a system which will create a class multiply inherited by around 1000 small and medium sized classes. I would be greatful if you can help me with the following: Can this create any problems with GNU g++ compilation, linking etc? What would be the impact to the performance of the system?...

C / C++

1767

Tricky Request Filtering Issue

by: Joseph Geretz | last post by:

I'm using the Request Filter documentation which can be found here: http://msdn.microsoft.com/en-us/library/system.web.httprequest.filter.aspx In this example, two filters are installed, one filter uppercases every alphabetic character and then the second filter replaces every 'E' with "#'. OK, very nice. What's trivial about these...

ASP.NET

7789

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language...

Windows Server

8301

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that...

Online Marketing

7894

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For...

Windows Server

8169

Discussion: How does Zigbee compare with other wireless protocols in smart home applications?

by: tracyyun | last post by:

Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the...

General

5361

Couldn’t get equations in html when convert word .docx file to html file in C#.

by: conductexam | last post by:

I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert...

C# / C Sharp

3803

Trying to create a lan-to-lan vpn between two differents networks

by: TSSRALBI | last post by:

Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in...

Networking - Hardware / Configuration

3820

Windows Forms - .Net 8.0

by: adsilva | last post by:

A Windows Forms form does not have the event Unload, like VB6. What one acts like?

Visual Basic .NET

1400

How to add payments to a PHP MySQL app.

by: muto222 | last post by:

How can i add a mobile payment intergratation into php mysql website.

PHP

1132

Comprehensive Guide to Website Development in Toronto: Expert Insights from BSMN Consultancy

by: bsmnconsultancy | last post by:

In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating...

General