Cameron Fletcher

Random thoughts and dicussions on the things that interest me

Sentence and Word Analysis #1

I was not put forward for a job recently because I had Windows Forms experience and not WinForms experience stated on my CV. The same agency also said I had not been singled out as they were looking for someone who had worked with MVC, unlike me as I had only worked with the Model View Controller framework.

I have come to understand that I have to not only write my CV to appeal to prospective employers who should know what I’m talking about but also for the multiple layers of incompetent individuals though which my CV must make its journey before arriving before someone who can actually read. I have come to refer to these individuals as the jam layer because whilst I am the icing on the cake (and most of the time, the cake itself - that is, after all, what I get paid for) they are the jam in-between because they have jam for brains.

So, I decided to analyse the results from a jobserve.com search for .NET roles based in the UK and identify the top used keywords and phrases so that I could litter my CV with them in the hope that someone with jam for brains would identify the correlation, even if they don't understand what that means.

To achieve this completed the following steps:

  1. I wrote an RSS reader in C# that read my RSS feed (for .NET jobs based in UK) that I'd set up through jobserve.com.
  2. The RSS reader then iterated through each posting and called a stored procedure in my SQL database that added the content of the job posting to a sentences table as a string.
  3. I then had a stored procedure that split the string into words and added them to a words table along with details of the sentence they were in and their position within that sentence.
  4. The more complex bit was looping through the words in each sentence concatenating from one to ten consecutive words together throughout the sentence and placing the resultant string into an analysis table.
  5. I then performed a simple groupby query eliminating the conjunctives (and, or, as, etc.) to retrieve the results.

This took me a couple of hours and as jobserve.com only allows you to receive the last 24 hours worth of job postings via RSS I present you with the top 30 ranked words/phrases from 75 job postings (below). The rank column details the number of occurrences of that word within the 75 postings.

#  Word Rank   #  Word Rank   #  Word Rank
1  experience 114   11  skills 44   21  Server 36
2  C# 83   12  team 44   22  test 36
3  developer 75   13  experience of 43   23  ASP.NET 35
4  development 73   14  Risk 43   24  candidate 33
5  Strong 56   15  SQL 43   25  Web 33
6  knowledge 51   16  role 42   26  Applications 31
7  working 51   17  Business 39   27  work 29
8  Investment 49   18  client 39   28  SQL server 28
9  .NET 45   19  based 38   29  contract 27
10  trading 45   20  knowledge of 36   30  CV 27

Annoyingly, these results are probably more use to anyone wishing to write an appealing CV profile about themselves. For my purposes I probably need a larger dataset to work on – I reckon about 500-600 job postings and more granular analysis of wordsets ie. group words into technical, competency based, business area, etc. – and also to look into maybe the top 100 from specific wordsets rather than just the top 30 generic words.

Part two of this post may be found here.

Posted: May 30 2009, 11:59 by flet0496 | Comments (2) RSS comment feed |
  • Currently 0/5 Stars.
  • 1
  • 2
  • 3
  • 4
  • 5
Filed under: .NET

Comments

Comments are closed