Skip to main content

What sex has to do with databases and the beginning of Data Science

Today I ran into a new Google search engine I did not know yet and which is very interesting. You can find it under the following URL:  Google ngrams

Working in the field of data management and data mining I thought it would be interesting to see the activity in our sector and compare it to something everybody knows. "Football" was not a correct choice because there is also "soccer" and because it excludes half of the writing population. 

So I chose "sex" which does not exclude populations, is widely known and written about. 

Next to that I added the terms "database", "big data", "interactive analysis", "data analysis" and "performance indicators". All well known terms in our world and know to the mostly a masculine part of the population. 

I did a comparison on both American English and British English.

If we look at the search results (until 2008 unfortunately) we can see that after Woodstock the interest in sex experienced a boom in the USA and started to decline at the beginning of the 80's, only to slightly revive during the "cigar affair"of president Clinton and eventually to decline after the year 2000. No interest any more in sex in the USA.

Databases seem to have their Woodstock in the beginning of the 80's and sharply decline after the year 2000. "Data Analysis" is coming up the same time but stays pretty stable since. Nobody is interested in "Big Data" for now and the same for "performance indicators" and "interactive analysis". Who cares.

If we look at the Britisch English results they are different:
The interest in "sex" started way earlier already in the mid 60's growing again all the way to the "cigar affair" of president Clinton and also showing a decline afterwards but the interest is rising again after 2005. 

The interest in "databases" seem to be correlated to "sex" but in a way that when there is more interest in sex there is less interest in databases and visa versa. Maybe this is related to the "James Bond" image the British have where spying always comes before or after the sex. 

Next to that there seems to be an increasing interest in "data analysis" and "interactive analysis", much more than on the American English side. 

As for American English there is also no interest at all in "Big Data" on the British English side.

I wrote this article to show you that if you want you can find a correlation between anything and everything. 

The university of Washington started a new course called Data Science and the goal of this course is to prevent these kind of false predictions and professionalise the business. 

Check it out on Course Data Science at the university of Washington

PS: Are you a data science analyst? Are these terms familiar?




Comments

Popular posts from this blog

Privacy and the liberty to express yourself on LinkedIn

Unaware that LinkedIn has such a strong filtering policy that it does not allow me posting a completely innocent post on a Chinese extreme photography website I tried to post the following: "As an Mpx lover I was suprised to find out that the M from Million is now replaced by the B from Billion. This picture is 24 Bpx! Yes you read this well, 24 billion pixels.  Searching on the picture I stumbled on a fellow Nikon lover. If you want to search for him yourself you can find him here: http://www.bigpixel.cn/t/5834170785f26b37002af46a " In my eyes nothing is wrong with this post, but LinkedIn considers it as offending. I changed the lover words, but I could not post it.  Even taking a picture and post it will not let this pass:  Or my critical post on LinkedIn crazy posting policy: it will not pass and I cannot post it.  The technology LinkedIn shows here is an example what to expect in the near future.  Newspapers will have a unified re...

Windows Server 2016 with ... XBox extensions !!!

Microsoft must have been thinking that the live of a Windows administrator gets so boring that they need a distraction and they have integrated the XBox Live extensions standard in Windows Server 2016.  No kidding: take a look. I did not select it as a feature and it is there. As you can see the XBox Live service is started automatically even in mode manual (see error log).  To my opinion these 3 XBox Live services should never be available on a server. Unless the server is part of the XBox Live platform of cause but honestly I don't think that Microsoft will allow that.  Every IT manager with a serious Windows production environment would fire any administrator playing XBox Live on the production servers. So though it might be tempting don't do it Windows administrator. It is a trap! :-)

Windows Storage Spaces and SQL Server: a ride to super performance

This post is based on the tests I did to see if storage spaces in Windows 2012R2 can serve as a platform for our Fast Track environments. When Microsoft developed the Fast Track Data Warehouse architecture, which was at first very limited in hardware choice and for version SQL Server 2012 became a reference guide, Storage Spaces as a functionality in Windows did not exist.That has changed with the release of Windows 2012 and later on with version 2012R2 and soon 2016. Why is Storage Spaces as a storage technology so interesting for SQL Server?  Anyone who is a pro in SQL Server knows that parallelism - adding more disks - can greatly improve performance. Adding an hard disk for the tempdb and another one for the LOG files will do the job if the hard disks perform sufficiently (that might be another issue!). To my opinion (and also to others) SQL Server does not do a great job in using the available hardware. Before and after the installation it is mandatory to tune a SQL Se...