Hello group.
I have an issue, which has bothered me for a while now:
I'm wondering why the column statistics, which SQL Server wants me to
create, if I turn off auto-created statistics, are so important to the
optimizer?
Example: from Northwind (with auto create stats off), I do the following:
SELECT * FROM Customers WHERE Country = 'Sweden'
My query plan show a clustered index scan, which is expected - no index
exists for Country. BUT, the query plan also shows, that the optimizer is
missing a statistic on Country, which tells me, that the optimizer would
benefit from knowing this.
I cannot see why? (and I've been trying for a while now).
If I create the missing statistics, nothing happens in the query plan (and
why should it?). I could understand it, if the optimizer suggested an index
on Country - this would make sense, but if creating the missing index, query
analyzer creates the statistics with an empty index, which seems to me to be
less than usable.
I've been thinking long and hard about this, but haven't been able to reach
a conclusion :) It has some relevance to my work, because allowing the
optimizer to create missing statistics limits my options for designing
indexes (e.g. covering) for some rather wide tables, so I'm thinking why not
turn it off altogether. But I would like to know the consequences - hope
somebody has already delved into this, and knows a good explanation.
Rgds
Jesper 5 6635
"Jesper Jensen" <mo********@pos t.tdcadsl.dk> wrote in message
news:40******** *************@d read12.news.tel e.dk... Hello group.
I have an issue, which has bothered me for a while now:
I'm wondering why the column statistics, which SQL Server wants me to create, if I turn off auto-created statistics, are so important to the optimizer?
Example: from Northwind (with auto create stats off), I do the following:
SELECT * FROM Customers WHERE Country = 'Sweden'
My query plan show a clustered index scan, which is expected - no index exists for Country. BUT, the query plan also shows, that the optimizer is missing a statistic on Country, which tells me, that the optimizer would benefit from knowing this.
I cannot see why? (and I've been trying for a while now).
If I create the missing statistics, nothing happens in the query plan (and why should it?). I could understand it, if the optimizer suggested an
index on Country - this would make sense, but if creating the missing index,
query analyzer creates the statistics with an empty index, which seems to me to
be less than usable.
I've been thinking long and hard about this, but haven't been able to
reach a conclusion :) It has some relevance to my work, because allowing the optimizer to create missing statistics limits my options for designing indexes (e.g. covering) for some rather wide tables, so I'm thinking why
not turn it off altogether. But I would like to know the consequences - hope somebody has already delved into this, and knows a good explanation.
Rgds Jesper
http://msdn.microsoft.com/library/de.../statquery.asp
Simon
Thanks, Simon, informative article, but ...
.... it doesn't really explain the stuff, that I wrote. The closest I get to
an explanation, when reading this is 'These statistics are created for
columns where the optimizer would have to estimate the approximate density
or distribution otherwise'.
I knew this, but I still do not know, why the optimizer needs to know the
density and/or distribution?? I can see no valid reason, and therefore I can
see no good reason for enabling auto-creation of stats.
What I probably looking for is a good example, where the use of an
automatically created stat saves time, cycles and IOs :)
Best Rgds - Jesper
"Simon Hayes" <sq*@hayes.ch > skrev i en meddelelse
news:40******** **@news.bluewin .ch... "Jesper Jensen" <mo********@pos t.tdcadsl.dk> wrote in message news:40******** *************@d read12.news.tel e.dk... Hello group.
I have an issue, which has bothered me for a while now:
I'm wondering why the column statistics, which SQL Server wants me to create, if I turn off auto-created statistics, are so important to the optimizer?
Example: from Northwind (with auto create stats off), I do the
following: SELECT * FROM Customers WHERE Country = 'Sweden'
My query plan show a clustered index scan, which is expected - no index exists for Country. BUT, the query plan also shows, that the optimizer
is missing a statistic on Country, which tells me, that the optimizer would benefit from knowing this.
I cannot see why? (and I've been trying for a while now).
If I create the missing statistics, nothing happens in the query plan
(and why should it?). I could understand it, if the optimizer suggested an index on Country - this would make sense, but if creating the missing index, query analyzer creates the statistics with an empty index, which seems to me
to be less than usable.
I've been thinking long and hard about this, but haven't been able to reach a conclusion :) It has some relevance to my work, because allowing the optimizer to create missing statistics limits my options for designing indexes (e.g. covering) for some rather wide tables, so I'm thinking why not turn it off altogether. But I would like to know the consequences - hope somebody has already delved into this, and knows a good explanation.
Rgds Jesper
http://msdn.microsoft.com/library/de.../statquery.asp Simon
"Jesper Jensen" <mo********@pos t.tdcadsl.dk> wrote in message
news:40******** *************@d read12.news.tel e.dk... Thanks, Simon, informative article, but ...
... it doesn't really explain the stuff, that I wrote. The closest I get
to an explanation, when reading this is 'These statistics are created for columns where the optimizer would have to estimate the approximate density or distribution otherwise'.
I knew this, but I still do not know, why the optimizer needs to know the density and/or distribution?? I can see no valid reason, and therefore I
can see no good reason for enabling auto-creation of stats.
What I probably looking for is a good example, where the use of an automatically created stat saves time, cycles and IOs :)
Best Rgds - Jesper
OK, here's another informative article :-) http://www.winnetmag.com/SQLServer/A...075/22075.html
In summary, index statistics exist only for the first column in an index,
but auto-created (or manually created) statistics can exist for any column.
This gives the optimizer extra information, which might mean it chooses a
different, more efficient index for a query.
Check out the example on the second page of the article - on my system, this
reduced the logical reads required for the query from 104 to 43.
But you're correct to consider that there can be an impact on performance in
some situations: http://support.microsoft.com/default...b;en-us;195565
Simon
Thanks, Simon, that one did the trick.
One less mystery.
On my machine, QA tells me that the two queries (the index scan on
ProductID/Quantity vs. the clustered index scan) takes 43.65 and 56.35% cost
respectively. I would argue, that this saving is not worth the 'used up
index space'. In my professional life, I've seen tables, which are wide
enough (200+ columns) to demand, that precious index space is saved.
Basically, I think there are too many 'ifs' before an auto-created index
saves performance, but I appreciate the optimization idea behind it.
Thanks - Jesper
"Simon Hayes" <sq*@hayes.ch > skrev i en meddelelse
news:40******** @news.bluewin.c h... "Jesper Jensen" <mo********@pos t.tdcadsl.dk> wrote in message news:40******** *************@d read12.news.tel e.dk... Thanks, Simon, informative article, but ...
... it doesn't really explain the stuff, that I wrote. The closest I get to an explanation, when reading this is 'These statistics are created for columns where the optimizer would have to estimate the approximate
density or distribution otherwise'.
I knew this, but I still do not know, why the optimizer needs to know
the density and/or distribution?? I can see no valid reason, and therefore I can see no good reason for enabling auto-creation of stats.
What I probably looking for is a good example, where the use of an automatically created stat saves time, cycles and IOs :)
Best Rgds - Jesper
OK, here's another informative article :-)
http://www.winnetmag.com/SQLServer/A...075/22075.html
In summary, index statistics exist only for the first column in an index, but auto-created (or manually created) statistics can exist for any
column. This gives the optimizer extra information, which might mean it chooses a different, more efficient index for a query.
Check out the example on the second page of the article - on my system,
this reduced the logical reads required for the query from 104 to 43.
But you're correct to consider that there can be an impact on performance
in some situations:
http://support.microsoft.com/default...b;en-us;195565
Simon
"Jesper Jensen" <mo********@pos t.tdcadsl.dk> wrote in message news:<40******* **************@ dread12.news.te le.dk>... Thanks, Simon, that one did the trick.
One less mystery.
On my machine, QA tells me that the two queries (the index scan on ProductID/Quantity vs. the clustered index scan) takes 43.65 and 56.35% cost respectively. I would argue, that this saving is not worth the 'used up index space'. In my professional life, I've seen tables, which are wide enough (200+ columns) to demand, that precious index space is saved.
Basically, I think there are too many 'ifs' before an auto-created index saves performance, but I appreciate the optimization idea behind it.
Thanks - Jesper
<snip>
Well, you have to be careful about reaching conclusions based on
simple queries using small data sets. It's possible that a complex
join involving millions of rows would give a more significant
difference. To get a definite answer for your environment, you would
have to do some benchmarking, with and without statistics.
Simon This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics |
by: Jerry Brenner |
last post by:
Our users have potentially dirty legacy data that they need to get
into our application. We provide a set of staging tables, which map
to our source tables, that the users do their ETL into. Every row in
the source tables has a generated integer id. Every row in both the
source and staging tables has a unique publicid (varchar(22)). All
foreign key references in the staging tables are through publicids.
(The foreign key reference could...
|
by: Harold |
last post by:
Hi,
I have a question in SQL Server 2K, I use SQL Profile to trace, and
find Stored Procedure was auto recompiled, like this row in the
trace:
SP:Recompile 15 1680 76 2004-02-27 16:01:11.610
How can I stop the auto recompile.
Thanks
Harold
|
by: williamc |
last post by:
Somebody asked me if it would be possible to add auto-advance to a web
form where there are a lot of repetitive 5 character fields. I took a
look around the web and found a script, which appears to work in the
couple of browsers I tried it in. However, when I look at the script it
appears to have the wrong number of brackets. Then when I changed the
script to the way I thought it should be, it still worked!
I'm totally rusty on...
|
by: Robin Tucker |
last post by:
Hi there,
I have this really frustrating problem with Visual Studio (Microsoft Visual
Basic .NET 2003 69586-335-0000007-18843). I have got used to using the
autocomplete and auto indent features of this excellent source editor but
recently it has only operated on some of my source files and not others. I
have made sure the settings in "tools->options->text editor" are set
correctly but to no avail. For example, hitting enter at the...
|
by: Beowulf |
last post by:
I was having this problem:
http://groups.google.com/group/microsoft.public.sqlserver.server/msg/e36e423972323378?dmode=source
with it taking an inordinate amount of time to enumerate the databases
on my local SQL Server instance, so I went through all the local
databases and unchecked "Auto Close" on the "Options" tab of the
Database properties in Enterprise Manager.
This cleared up the problem nicely. In EM or through SQL-DMO, the...
| |
by: zimes |
last post by:
I am working with Microsoft Access 2003 from the Office XP pro suite. I have created several tables that have to do with Asset tracking along with purchasing. I have a table called Purchasing, Assets, Comapnay, AssetCategorty, and AssetNumber. The number I would like to create would take data from Purchasing and Assets, then create a number that would be in this format:
YY| company 3 letter abrivation | Asset Type | 3 digit auto generated...
|
by: Iluvatar |
last post by:
Hi,
last week our project upgraded from vs.net 2005 to vs.net 2008. Our
project still uses framework 2.0, because the deployment servers are
not 3.x ready yet. Everything so far while migrating the solution and
project files worked fine.
Today, I tried to add new controls to an existing web form. The markup
doesn't show any errors, but in the auto generated code behind
(*.aspx.designer.cs) I am missing the declartion of the added...
|
by: Bobby Edward |
last post by:
I am using VS 2008 SP1, ASP.NET 3.5, VB.NET, MySQL (latest version).
I created a new dataset using VS. In the ds I have a users table. I let it
create the "Fill, GetData" functions in the usersTableAdapter.
When I call the INSERT command, how do I get it to return the ID (the PK) of
the newly created record?
I pulled up the properties of the TableAdapter, then went to the
InsertCommand/Parameters. I added a parameter (for the pk I...
|
by: JOYCE |
last post by:
Look the subject,that's my problem!
I hope someone can help me, thanks
|
by: marktang |
last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look !
Part I. Meaning of...
|
by: Hystou |
last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it.
First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
| |
by: jinu1996 |
last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth.
The Art of Business Website Design
Your website is...
|
by: Hystou |
last post by:
Overview:
Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
|
by: tracyyun |
last post by:
Dear forum friends,
With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
|
by: isladogs |
last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM).
In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules.
He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms.
Adolph will...
|
by: adsilva |
last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
|
by: 6302768590 |
last post by:
Hai team
i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
| |
by: muto222 |
last post by:
How can i add a mobile payment intergratation into php mysql website.
| |