Skip to main content

What sex has to do with databases and the beginning of Data Science

Today I ran into a new Google search engine I did not know yet and which is very interesting. You can find it under the following URL:  Google ngrams

Working in the field of data management and data mining I thought it would be interesting to see the activity in our sector and compare it to something everybody knows. "Football" was not a correct choice because there is also "soccer" and because it excludes half of the writing population. 

So I chose "sex" which does not exclude populations, is widely known and written about. 

Next to that I added the terms "database", "big data", "interactive analysis", "data analysis" and "performance indicators". All well known terms in our world and know to the mostly a masculine part of the population. 

I did a comparison on both American English and British English.

If we look at the search results (until 2008 unfortunately) we can see that after Woodstock the interest in sex experienced a boom in the USA and started to decline at the beginning of the 80's, only to slightly revive during the "cigar affair"of president Clinton and eventually to decline after the year 2000. No interest any more in sex in the USA.

Databases seem to have their Woodstock in the beginning of the 80's and sharply decline after the year 2000. "Data Analysis" is coming up the same time but stays pretty stable since. Nobody is interested in "Big Data" for now and the same for "performance indicators" and "interactive analysis". Who cares.

If we look at the Britisch English results they are different:
The interest in "sex" started way earlier already in the mid 60's growing again all the way to the "cigar affair" of president Clinton and also showing a decline afterwards but the interest is rising again after 2005. 

The interest in "databases" seem to be correlated to "sex" but in a way that when there is more interest in sex there is less interest in databases and visa versa. Maybe this is related to the "James Bond" image the British have where spying always comes before or after the sex. 

Next to that there seems to be an increasing interest in "data analysis" and "interactive analysis", much more than on the American English side. 

As for American English there is also no interest at all in "Big Data" on the British English side.

I wrote this article to show you that if you want you can find a correlation between anything and everything. 

The university of Washington started a new course called Data Science and the goal of this course is to prevent these kind of false predictions and professionalise the business. 

Check it out on Course Data Science at the university of Washington

PS: Are you a data science analyst? Are these terms familiar?




Comments

Popular posts from this blog

Privacy and the liberty to express yourself on LinkedIn

Unaware that LinkedIn has such a strong filtering policy that it does not allow me posting a completely innocent post on a Chinese extreme photography website I tried to post the following: "As an Mpx lover I was suprised to find out that the M from Million is now replaced by the B from Billion. This picture is 24 Bpx! Yes you read this well, 24 billion pixels.  Searching on the picture I stumbled on a fellow Nikon lover. If you want to search for him yourself you can find him here: http://www.bigpixel.cn/t/5834170785f26b37002af46a " In my eyes nothing is wrong with this post, but LinkedIn considers it as offending. I changed the lover words, but I could not post it.  Even taking a picture and post it will not let this pass:  Or my critical post on LinkedIn crazy posting policy: it will not pass and I cannot post it.  The technology LinkedIn shows here is an example what to expect in the near future.  Newspapers will have a unified reporting using

Windows Server 2016 with ... XBox extensions !!!

Microsoft must have been thinking that the live of a Windows administrator gets so boring that they need a distraction and they have integrated the XBox Live extensions standard in Windows Server 2016.  No kidding: take a look. I did not select it as a feature and it is there. As you can see the XBox Live service is started automatically even in mode manual (see error log).  To my opinion these 3 XBox Live services should never be available on a server. Unless the server is part of the XBox Live platform of cause but honestly I don't think that Microsoft will allow that.  Every IT manager with a serious Windows production environment would fire any administrator playing XBox Live on the production servers. So though it might be tempting don't do it Windows administrator. It is a trap! :-)

How to run SQL Server 2016 with In-Databasse R on Windows 2016 CTP5

For those who like me tried to run SQL Server 2016 with In-Database R might have run into the same problem as me: In-Database R or the LaunchPad service gives a timeout and won't start. I did several clean installations with different configuration options - for instance I like to put my data on another disk than the system disk - but in the end I tried to do the next, next, next, finish install to see if it something in the setup options is hard coded in there (yes, it happens developers!). For some reason this problem is related to Windows 2016 and not on Windows 2012R2 and I hope the SQL Server team will soon resolve these issues because they are in one word a bit sloppy.  There are 2 issues (maybe even 3 so I give this one also):  The R setup does not create the ExtensibilityLog directory in the "C:\Program Files\Microsoft SQL Server\MSSQL13.MSSQLSERVER\MSSQL\Log" directory The R setup sets the number of users in the SQL Server Launchpad service to 0 it is pos