Skip to main content

Posts

Showing posts from 2014

What sex has to do with databases and the beginning of Data Science

Today I ran into a new Google search engine I did not know yet and which is very interesting. You can find it under the following URL:  Google ngrams Working in the field of data management and data mining I thought it would be interesting to see the activity in our sector and compare it to something everybody knows. "Football" was not a correct choice because there is also "soccer" and because it excludes half of the writing population.  So I chose "sex" which does not exclude populations, is widely known and written about.  Next to that I added the terms "database", "big data", "interactive analysis", "data analysis" and "performance indicators". All well known terms in our world and know to the mostly a masculine part of the population.  I did a comparison on both American English and British English. If we look at the search results (until 2008 unfortunately) we can see that after Woodstoc

Bug in Microsoft Management Data Warehouse (MDW).

For those who use MDW and complain about the bug in the report Disk Usage Collection Set - Database, I updated the query so that it shows now the correct data. The bug is very consistent: it's in there from 2008 R2 and still there. Take a look at the report: As you can see the report shows more than 8 GB but in the data below you can see that the database size is no more than 4,3 GB.  Someone just made a programming error at Microsoft and unknown on what the field values mean in MDW just mixed up some numbers. I have to admit that it is not easy :-) I rewrote the query after comparing the values with a real database and added the log value. Now it is to Microsoft to add it.  Here's the code (I highlighted the changes): exec sp_executesql @stmt = N'BEGIN TRY DECLARE @snapshot_id int SELECT TOP 1 @snapshot_id = snapshot_id FROM ( SELECT DISTINCT TOP 100 d.snapshot_id FROM snapshots.disk_usage d, core.snapshots ss WHERE ss.instance_name = @S