I have a table that contains transactional data. Such as site view by
whom, when, which template, etc, etc...
Everytime when I pulled the report, hh:mm:ss never matters. Only
breakdown by dates, not time.
Having read some chapters of Ralph Kimball's book, I am inspired to
build "date" table with integer as primary key.
Here's what I have for schema of transactional table.
- viewed_customer_id int (4bytes)
- template_id uniqidentifier (16 bytes)
- viewed_on datetime (8 bytes)
And here's the version I am thinking of building
- viewed_customer_id int (4bytes)
- template_id uniqidentifier (16 bytes)
- viewed_date_key int (4 bytes)
- seconds int (4 bytes)
* I put seconds just in case I need to retrieve hour based or minute
based report.
Here's my question. I've also noticed that smalldatetime is also 4bytes
of memory but it consists of 2 sets of 2 bytes.
When I index, would there be significant performance difference between
indexing 4 byte of column and indexing 2bytes x 2 of column? 7 4593
Bostonasian (ax****@gmail.com) writes: I have a table that contains transactional data. Such as site view by whom, when, which template, etc, etc... Everytime when I pulled the report, hh:mm:ss never matters. Only breakdown by dates, not time. Having read some chapters of Ralph Kimball's book, I am inspired to build "date" table with integer as primary key.
Here's what I have for schema of transactional table.
- viewed_customer_id int (4bytes) - template_id uniqidentifier (16 bytes) - viewed_on datetime (8 bytes)
And here's the version I am thinking of building
- viewed_customer_id int (4bytes) - template_id uniqidentifier (16 bytes) - viewed_date_key int (4 bytes) - seconds int (4 bytes)
The problem with using int for dates is that you might run into confusion
what your zero date is. In SQL Server the base date is 1900-01-01. In
Visual Basic etc it's 1899-12-30. In Unix it's 1970-01-01.
While datetime is not ideal for dates-only data, it's fairly easy
to use. In our system we actually have a type for it:
EXEC sp_addtype 'aba_date', 'datetime'
go
CREATE RULE aba_date_rule AS convert(char(8), @x, 112) = @x
go
EXEC sp_bindrule 'aba_date_rule', 'aba_date'
go
So it's not even possible to sneak in any seconds there. To strip
Hours and seconds from a value, this is the deal:
convert(char(8), date_value, 112)
(It's important to use format 112, as the resulting string will
always be converted back to datetime correctly.)
Here's my question. I've also noticed that smalldatetime is also 4bytes of memory but it consists of 2 sets of 2 bytes.
When I index, would there be significant performance difference between indexing 4 byte of column and indexing 2bytes x 2 of column?
4 bytes is 4 bytes. While smalldatetime may be described as 2+2, it's
nevertheless just another 32 bits when it comes to indexing.
As for whether smalldatetime is an option to datetime, it depends on
your business requirements. If all you use it for is tracking events,
then it might do. We have abandoned smalldatetime in our system, save
for some auditing columns, because we have encountered real-world
data beyond 2076.
--
Erland Sommarskog, SQL Server MVP, es****@sommarskog.se
Books Online for SQL Server SP3 at http://www.microsoft.com/sql/techinf...2000/books.asp
Thanks for the advice.
I got that "date" table from Ralph Kimball's "Data Warehouse tookit",
from "Retail" chapter.
I guess in the retail. It's crucial to know sales date was weekday or
weekend? Was is holiday? Which fiscal month/year it was, etc, etc...
Beyond reporting by only dates, there are occasions where I need to
come up with analysis(though it was not often).
Thanks for the tips.
On 30 Mar 2005 07:54:35 -0800, Bostonasian wrote: Thanks for the advice. I got that "date" table from Ralph Kimball's "Data Warehouse tookit", from "Retail" chapter. I guess in the retail. It's crucial to know sales date was weekday or weekend? Was is holiday? Which fiscal month/year it was, etc, etc... Beyond reporting by only dates, there are occasions where I need to come up with analysis(though it was not often).
Thanks for the tips.
Hi Bostonasian,
Having a table with all dates and various properties of each date (such
as weekend, public holiday, reporting period, etc.) is good. Using an
integer to represent the data is not.
To save myself the typing, I'll just refer you to www.aspfaq.com/2519,
where it is all explained in full detail.
Best, Hugo
--
(Remove _NO_ and _SPAM_ to get my e-mail address)
"Hugo Kornelis" <hugo@pe_NO_rFact.in_SPAM_fo> wrote in message
news:jh********************************@4ax.com... On 30 Mar 2005 07:54:35 -0800, Bostonasian wrote:
Thanks for the advice. I got that "date" table from Ralph Kimball's "Data Warehouse tookit", from "Retail" chapter. I guess in the retail. It's crucial to know sales date was weekday or weekend? Was is holiday? Which fiscal month/year it was, etc, etc... Beyond reporting by only dates, there are occasions where I need to come up with analysis(though it was not often).
Thanks for the tips.
Hi Bostonasian,
Having a table with all dates and various properties of each date (such as weekend, public holiday, reporting period, etc.) is good. Using an integer to represent the data is not.
To save myself the typing, I'll just refer you to www.aspfaq.com/2519, where it is all explained in full detail.
Best, Hugo --
(Remove _NO_ and _SPAM_ to get my e-mail address)
While I agree that datetypes should be as close to what they represent,
in the book he refers to the fact table with the detailed data, has an
integer as a key ID to a seperate date table.
The actual datetime value, and all the necesary flags and groupings ( fiscal
quarter, date of week, holiday, etc..) is stored in that side car or
outrigger table.
Since current production versions of SQL don't have a seperate date
datatype, I tend to use smalldatetime instead.
With the side car table, a text representation can also be done to make
reporting easier
i.e. FullDateName column could be "Wednesday, February 31st 2007" , but
sorting is done by actual datetime datatype column.
On Thu, 31 Mar 2005 04:34:02 GMT, David Rawheiser wrote:
(snip quoteback) While I agree that datetypes should be as close to what they represent, in the book he refers to the fact table with the detailed data, has an integer as a key ID to a seperate date table.
The actual datetime value, and all the necesary flags and groupings ( fiscal quarter, date of week, holiday, etc..) is stored in that side car or outrigger table.
(snip)
Hi David,
I don't have this book, but what I'd like to know is what advantage the
integer ID key of this side car table has over a similar side table with
a datetime (or smalldatetime) key, with a CHECK constraint to ensure
that only the default time portion (midnight) is allowed.
Since current production versions of SQL don't have a seperate date datatype, I tend to use smalldatetime instead.
As long as you're aware that smalldatetime has a time portion as well
(though with less precision) and that dates after June 6, 2079 can't be
represented, this is fine.
With the side car table, a text representation can also be done to make reporting easier i.e. FullDateName column could be "Wednesday, February 31st 2007" , but sorting is done by actual datetime datatype column.
I agree - but that can equally well be done if the side car table uses a
[small]datetime column os primary key.
Best, Hugo
--
(Remove _NO_ and _SPAM_ to get my e-mail address)
"Hugo Kornelis" <hugo@pe_NO_rFact.in_SPAM_fo> wrote in message
news:je********************************@4ax.com... On Thu, 31 Mar 2005 04:34:02 GMT, David Rawheiser wrote:
(snip quoteback)While I agree that datetypes should be as close to what they represent, in the book he refers to the fact table with the detailed data, has an integer as a key ID to a seperate date table.
The actual datetime value, and all the necesary flags and groupings ( fiscal quarter, date of week, holiday, etc..) is stored in that side car or outrigger table. (snip)
Hi David,
I don't have this book, but what I'd like to know is what advantage the integer ID key of this side car table has over a similar side table with a datetime (or smalldatetime) key, with a CHECK constraint to ensure that only the default time portion (midnight) is allowed.
Since current production versions of SQL don't have a seperate date datatype, I tend to use smalldatetime instead.
As long as you're aware that smalldatetime has a time portion as well (though with less precision) and that dates after June 6, 2079 can't be represented, this is fine.
With the side car table, a text representation can also be done to make reporting easier i.e. FullDateName column could be "Wednesday, February 31st 2007" , but sorting is done by actual datetime datatype column.
I agree - but that can equally well be done if the side car table uses a [small]datetime column os primary key.
Best, Hugo --
(Remove _NO_ and _SPAM_ to get my e-mail address)
I depends on how you are using the date.
If you don't need to present the items by grouping by month, sales period,
fiscal quarters, day of week, etc.
or if you don't want to include weekends or holidays or full moons, etc...
in your reports,
you either need to:
a) put a boat load of complex code in your procedures or functions to
determine these things.
b) create a date calendar table, add columns and set the flags on each of
these atributes.
Thus the resulting query is a simple join from the fact table to this
calendar table and include/exclude in the where clause or groupby those you
need.
You can use the date datatype itself as the keys, and get the same results
but in the book it tends to use abstract datatypes for the fact table
attributes. http://www.amazon.com/exec/obidos/tg...&s=ebooks&st=*
On Fri, 01 Apr 2005 04:30:36 GMT, David Rawheiser wrote:
(snip) I depends on how you are using the date.
If you don't need to present the items by grouping by month, sales period, fiscal quarters, day of week, etc. or if you don't want to include weekends or holidays or full moons, etc... in your reports, you either need to: a) put a boat load of complex code in your procedures or functions to determine these things. b) create a date calendar table, add columns and set the flags on each of these atributes.
Hi David,
I fully agree - and in most cases, the calendar table is the better
option.
Thus the resulting query is a simple join from the fact table to this calendar table and include/exclude in the where clause or groupby those you need.
Yep.
You can use the date datatype itself as the keys, and get the same results but in the book it tends to use abstract datatypes for the fact table attributes.
http://www.amazon.com/exec/obidos/tg...&s=ebooks&st=*
And this is where I disagrgee with the advise given in the book. Sure,
there are situations where a surrogate key is better than a natural key,
but in this specific case, I fail to see any advantages.
Like I said - I don't have this book (and I don't intend to buy it
either :-P) Do the authors give any reasons for their choice to use an
artificial key here? If so, I'd be interested to hear it!
Best, Hugo
--
(Remove _NO_ and _SPAM_ to get my e-mail address) This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics
by: Uli |
last post by:
Dear group,
I have to replicate remote data to a SQL Server in the headquarter.
The data are on a site which does not have permanent online-connection
to the headquarter.
I have written a...
|
by: Bradley.. .. . . .. .. |
last post by:
Requirement: managers can login and only see employee data for the
department they manage and any sub-ordinate departments.
The departments are in a table that defines a simple tree structure...
|
by: Kukurydz |
last post by:
I've got such problem:
My database is stored on MSSQL Server. I have to write reports for it in
MSAccess. I've got a problem with creating a query which will select records
from MSSQL table with...
|
by: Rookie Card |
last post by:
Issue - Reformating the Dates in ASP.NET from a MSSQL Database that has
<NULL> values in a SmallDateTime Field
to read "Dec 8, 2000" instead of "12/8/2000 12:00:00 AM"
(As you might already...
|
by: jblankenburg |
last post by:
Please help!
I am hunting high and low for an equivalent function for MSSQL's
DATENAME function. Here's the spec on the function from MSSQL's Books
Online:
DATENAME
Returns a character...
|
by: wildfyre53207 |
last post by:
Here is our problem...
We are doing a lot of selects against a table that has one large field
in it.
If we do a select against all the fields except for description, the
query comes back...
|
by: c3q8 |
last post by:
I have a batch system where the user enter suppliers invoices to a batch table (SupVchBat)
then selectively update certain invoice to outstanding invoice table (SupVch) and updated
suppliers...
|
by: Igal |
last post by:
I'm trying to insert a date value into MSSQL, the type of the sql filed
is: "smalldatetime"
and i'm trying to insert a text Variable that looks like this:
"19/02/2006".
.... SET update_date='"...
|
by: Normann |
last post by:
I am creating a Stored Proc and I need to be able to select the last added row, now this should be made easier by the fact that I have a smalldatetime column in the table that is added every time a...
|
by: antdb |
last post by:
Ⅰ. Advantage of AntDB: hyper-convergence + streaming processing engine
In the overall architecture, a new "hyper-convergence" concept was proposed, which integrated multiple engines and...
|
by: AndyPSV |
last post by:
HOW CAN I CREATE AN AI with an .executable file that would suck all files in the folder and on my computerHOW CAN I CREATE AN AI with an .executable file that would suck all files in the folder and...
|
by: Arjunsri |
last post by:
I have a Redshift database that I need to use as an import data source. I have configured the DSN connection using the server, port, database, and credentials and received a successful connection...
|
by: WisdomUfot |
last post by:
It's an interesting question you've got about how Gmail hides the HTTP referrer when a link in an email is clicked. While I don't have the specific technical details, Gmail likely implements measures...
|
by: Oralloy |
last post by:
Hello Folks,
I am trying to hook up a CPU which I designed using SystemC to I/O pins on an FPGA.
My problem (spelled failure) is with the synthesis of my design into a bitstream, not the C++...
|
by: BLUEPANDA |
last post by:
At BluePanda Dev, we're passionate about building high-quality software and sharing our knowledge with the community. That's why we've created a SaaS starter kit that's not only easy to use but also...
|
by: Rahul1995seven |
last post by:
Introduction:
In the realm of programming languages, Python has emerged as a powerhouse. With its simplicity, versatility, and robustness, Python has gained popularity among beginners and experts...
|
by: Ricardo de Mila |
last post by:
Dear people, good afternoon...
I have a form in msAccess with lots of controls and a specific routine must be triggered if the mouse_down event happens in any control.
Than I need to discover what...
|
by: ezappsrUS |
last post by:
Hi,
I wonder if someone knows where I am going wrong below. I have a continuous form and two labels where only one would be visible depending on the checkbox being checked or not. Below is the...
| |