473,394 Members | 1,696 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,394 software developers and data experts.

Need Opinions on DataSet usage

All,

I'm looking for opinions on the best way to accomplish a given task.
Here is the senario:

I have a Data Warehouse that I need to update with information that has been
loaded into a staging database. We are dealing only with three tables (a
source table, and the destination dimension and fact). I need to process
each "new" record in the source (staging) table, transform the data and then
update or insert into the dimension table and insert into the fact table.

Question:
Is it better to simply use a data reader to interate through each new record
and then on an individual basis using a command object simply update or
insert into the dimension/fact table as appropriate? Or is there a reason
to use the DataSet for all operations and if so what would be the strategy
for loading the Dataset table for the Dimension with both the rows to be
added and the rows to be updated? I don't want to go out and retrieve ALL
the dimension records which could be 100's of thousands of rows just to
locate the ones that need to be updated. Also I can't identify the ones to
be updated until some of the transformation process on the source data has
occurred.

Any thoughts suggestions appreciated,

JamesK.
Nov 20 '05 #1
2 1438
JLK,

Any reason for not using DTS?

Telmo Sampaio

"-=JLK=-" <No***********@nospam.com> wrote in message
news:Ox*************@TK2MSFTNGP09.phx.gbl...
All,

I'm looking for opinions on the best way to accomplish a given task.
Here is the senario:

I have a Data Warehouse that I need to update with information that has been loaded into a staging database. We are dealing only with three tables (a
source table, and the destination dimension and fact). I need to process
each "new" record in the source (staging) table, transform the data and then update or insert into the dimension table and insert into the fact table.

Question:
Is it better to simply use a data reader to interate through each new record and then on an individual basis using a command object simply update or
insert into the dimension/fact table as appropriate? Or is there a reason
to use the DataSet for all operations and if so what would be the strategy
for loading the Dataset table for the Dimension with both the rows to be
added and the rows to be updated? I don't want to go out and retrieve ALL the dimension records which could be 100's of thousands of rows just to
locate the ones that need to be updated. Also I can't identify the ones to be updated until some of the transformation process on the source data has
occurred.

Any thoughts suggestions appreciated,

JamesK.

Nov 20 '05 #2
Actually we do use DTS, though we have found it limiting in places. Using
VB.net is simply an alternative we are exploring for some situations. That
is what lead to my current question of which would be better the DataSet or
DataReader based on the given scenario. As may have been noticed in a
different post I made I was looking at the ability to "add" a row to the
dataset but fool it into thinking it was an update not an insert when the
key was already present in the target table, which would then preclude
having to retrieve the whole table to begin with.

Thanks,

James K.

"Telmo Sampaio" <te***********@hotmail.com> wrote in message
news:uV**************@tk2msftngp13.phx.gbl...
JLK,

Any reason for not using DTS?

Telmo Sampaio

"-=JLK=-" <No***********@nospam.com> wrote in message
news:Ox*************@TK2MSFTNGP09.phx.gbl...
All,

I'm looking for opinions on the best way to accomplish a given task.
Here is the senario:

I have a Data Warehouse that I need to update with information that has

been
loaded into a staging database. We are dealing only with three tables (a source table, and the destination dimension and fact). I need to process each "new" record in the source (staging) table, transform the data and

then
update or insert into the dimension table and insert into the fact table.
Question:
Is it better to simply use a data reader to interate through each new

record
and then on an individual basis using a command object simply update or
insert into the dimension/fact table as appropriate? Or is there a reason to use the DataSet for all operations and if so what would be the strategy for loading the Dataset table for the Dimension with both the rows to be
added and the rows to be updated? I don't want to go out and retrieve

ALL
the dimension records which could be 100's of thousands of rows just to
locate the ones that need to be updated. Also I can't identify the ones

to
be updated until some of the transformation process on the source data has occurred.

Any thoughts suggestions appreciated,

JamesK.


Nov 20 '05 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

35
by: jerrygarciuh | last post by:
Hi all, I was just wondering what popular opinion is on PHP giving this warning: Warning: Invalid argument supplied for foreach() in /home/boogerpic/public_html/my.php on line 6 when...
181
by: Tom Anderson | last post by:
Comrades, During our current discussion of the fate of functional constructs in python, someone brought up Guido's bull on the matter: http://www.artima.com/weblogs/viewpost.jsp?thread=98196 ...
0
by: Koncept | last post by:
Sorry for asking, but I give up on this situation. I am a total n00b at Perl and have only used it for about 1 week now. I would really appreciate somebody's help here because I am really feeling...
3
by: Derrick | last post by:
I am reading in xml files that equate to sql tables, via XmlDataDocument, and then operating on the DataSet. With the most simple app that just loads the xml doc, I see the memory footprint of the...
3
by: ALI-R | last post by:
in the following statement why we have to give Dataset a name? what is it used for? Dataset myDataset =new Dataset("name_of_Dataset"); what is "name_of_Dataset" and what is its usage? thanks
2
by: Wysiwyg | last post by:
I was hoping to get some opinions on the efficiency of various methods of reusing the same dropdown list data. Here is the situation: Multiple panels on maintenance pages with TAB menus across...
9
by: SQLScott | last post by:
I have always heard that using IIF statements is not recommended because they are slower than using If..Then statements, and not as "clean". I would appreciate others opinions and why/why not use...
2
by: Walt | last post by:
I have a small xml document that I bring into my application by reading the xml file with a dataset: Dim ds As New DataSet ds.ReadXml(fileName) ' Do some work with the tables in the dataset...
0
by: c.w.browne | last post by:
Hi, Ive had a bit of a look around for other people with this problem and cant find anything that solves it in my case, so I'm afraid im going to have to bother you all with a post of my own. ...
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.