<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Boolean Black Belt &#187; Semantic Search</title>
	<atom:link href="http://www.booleanblackbelt.com/category/semantic-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.booleanblackbelt.com</link>
	<description>Leveraging social networks, resume databases, and the Internet for sourcing and recruiting</description>
	<lastBuildDate>Tue, 27 Jul 2010 14:00:08 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.3</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>SourceCon 2010: Resume Sourcing and Matching &#8211; AI vs. Humans</title>
		<link>http://www.booleanblackbelt.com/2010/03/sourcecon-2010-resume-sourcing-and-matching-ai-vs-humans/</link>
		<comments>http://www.booleanblackbelt.com/2010/03/sourcecon-2010-resume-sourcing-and-matching-ai-vs-humans/#comments</comments>
		<pubDate>Tue, 16 Mar 2010 18:15:34 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Artificial Intelligence Matching]]></category>
		<category><![CDATA[Boolean]]></category>
		<category><![CDATA[Extended Boolean]]></category>
		<category><![CDATA[Hidden Talent Pools]]></category>
		<category><![CDATA[Human Capital Data]]></category>
		<category><![CDATA[Proximity Searching]]></category>
		<category><![CDATA[Recruiting Technology]]></category>
		<category><![CDATA[Search Process]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[SourceCon]]></category>
		<category><![CDATA[Sourcing Automation]]></category>
		<category><![CDATA[Talent Mining]]></category>
		<category><![CDATA[2010]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Glen Cathey]]></category>
		<category><![CDATA[Intelligent Search and Match]]></category>
		<category><![CDATA[Keynote]]></category>
		<category><![CDATA[Recruiting]]></category>
		<category><![CDATA[Resume Matching]]></category>
		<category><![CDATA[Resume Sourcing]]></category>
		<category><![CDATA[Sourcing]]></category>
		<category><![CDATA[SoureceCon]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=5093</guid>
		<description><![CDATA[Here is the expanded slide deck from my SourceCon 2010 Keynote: Resume Sourcing and Matching &#8211; Artificial Intelligence vs. Human Cognition. It contains all of the talking points as text so you are not left guessing as to what I spoke to during the live presentation.  
You&#8217;ll learn about the intrinsic and often overlooked challenges [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2010%2F03%2Fsourcecon-2010-resume-sourcing-and-matching-ai-vs-humans%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2010%2F03%2Fsourcecon-2010-resume-sourcing-and-matching-ai-vs-humans%2F" height="61" width="51" /></a></div><p>Here is the expanded slide deck from my SourceCon 2010 Keynote: Resume Sourcing and Matching &#8211; Artificial Intelligence vs. Human Cognition. It contains all of the talking points as text so you are not left guessing as to what I spoke to during the live presentation. <img src='http://www.booleanblackbelt.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>You&#8217;ll learn about the intrinsic and often overlooked challenges associated with sourcing resumes (it&#8217;s deceptively complex), what artificially intelligent semantic search and match applications claim to do and how they actually work, the limits of artificial intelligence, what people can do that semantic search applications cannot, the 5 levels of semantic search,  the 5 levels of talent mining, and what I think is the ideal candidate sourcing solution.</p>
<div id="__ss_3447353" style="width: 425px;"><strong style="display:block;margin:12px 0 4px"><a title="SourceCon 2010: Resume Sourcing and Matching: Artificial Intelligence vs. Human Cognition" href="http://www.slideshare.net/glencathey/sourcecon-2010-resume-sourcing-and-matching-artificial-intelligence-vs-human-cognition-3447353">SourceCon 2010: Resume Sourcing and Matching: Artificial Intelligence vs. Human Cognition</a></strong><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=sourceconpresentationfullv5forslideshare-100316124352-phpapp01&amp;rel=0&amp;stripped_title=sourcecon-2010-resume-sourcing-and-matching-artificial-intelligence-vs-human-cognition-3447353" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="425" height="355" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=sourceconpresentationfullv5forslideshare-100316124352-phpapp01&amp;rel=0&amp;stripped_title=sourcecon-2010-resume-sourcing-and-matching-artificial-intelligence-vs-human-cognition-3447353" allowscriptaccess="always" allowfullscreen="true"></embed></object></div>
<div style="padding:5px 0 12px">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/glencathey">Glen Cathey</a>.</div>
<div style="padding:5px 0 12px">Additionally, you can view the video from the SourceCon event <a class="wp-caption-dd" title="Video of SourceCon 2010 Keynote: Resume Sourcing and Matching - Artificial Intelligence vs. Human Cognition" href="http://www.sourcecon.com/2010/session-descriptions/#session-85" target="_self">here</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2010/03/sourcecon-2010-resume-sourcing-and-matching-ai-vs-humans/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Automated Semantic Search: Proceed with Caution</title>
		<link>http://www.booleanblackbelt.com/2009/07/candidate-search-automation-proceed-with-caution/</link>
		<comments>http://www.booleanblackbelt.com/2009/07/candidate-search-automation-proceed-with-caution/#comments</comments>
		<pubDate>Tue, 21 Jul 2009 17:37:28 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Artificial Intelligence Matching]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Sourcing Automation]]></category>
		<category><![CDATA[Automated Candidate Sourcing]]></category>
		<category><![CDATA[Automated Semantic Search]]></category>
		<category><![CDATA[Candidate Matching Applications]]></category>
		<category><![CDATA[Candidate Sourcing]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=3080</guid>
		<description><![CDATA[There are many software vendors who offer solutions to HR, recruiting, and sourcing organizations that claim to have automated candidate search and match capability. These applications can take your search, a job description, or an example resume and claim to leverage semantic search, fuzzy logic and/or Artificial Intelligence search technology to return relevant results. 
I&#8217;ve had the opportunity to use [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F07%2Fcandidate-search-automation-proceed-with-caution%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F07%2Fcandidate-search-automation-proceed-with-caution%2F" height="61" width="51" /></a></div><p style="text-align: left;"><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/07/artificial-intelligence-by-thealieness-giselagiardinoc2b2c2b3.jpg"><img class="alignright size-full wp-image-3437" title="artificial-intelligence-by-thealieness-giselagiardinoc2b2c2b3" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/07/artificial-intelligence-by-thealieness-giselagiardinoc2b2c2b3.jpg" alt="" width="191" height="192" /></a>There are many software vendors who offer solutions to HR, recruiting, and sourcing organizations that claim to have automated candidate search and match capability. These applications can take your search, a job description, or an example resume and claim to leverage semantic search, fuzzy logic and/or Artificial Intelligence search technology to return relevant results. </p>
<p style="text-align: left;">I&#8217;ve had the opportunity to use and evaluate 4 &#8220;big name&#8221; semantic and intelligent search/match applications for identifying candidates, and I am currently implementing one of them, including cutomizing the <a class="wp-caption-dd" title="Here's a good SlideShare presentation about implementing semantic search that explains taxonomies and ontologies" href="http://www.slideshare.net/pwlodar1/implementing-semantic-search" target="_self">taxonomies and ontologies</a> - so I have quite a bit of hands-on, practical experience with semantic search applications. While I do think that intelligent semantic search and match applications definitely have a place in the sourcing and recruiting process, they should not be looked at as solutions to a problem, challenge, or a deficiency in your skills or your team&#8217;s.</p>
<p style="text-align: left;">Before implementing a semantic search application, it is important to first understand the manual search process, how automated semantic search works, and the intrinsic limitations of of semantic search applications. </p>
<h3>People Do the Work, Computers Move Information</h3>
<p>Eiji Toyoda, the fifth president of Toyota Motor Corporation, who collaborated Taiichi Ohno to fine tune the concept of <a class="wp-caption-dd" title="Learn more about Kaizen" href="http://en.wikipedia.org/wiki/Kaizen" target="_blank">Kaizen</a> as well as to develop the core concepts of the <a class="wp-caption-dd" title="Learn more about The Toyota Way" href="http://en.wikipedia.org/wiki/The_Toyota_Way" target="_blank">&#8216;Toyota Way&#8217;</a>, explains brilliantly: &#8221;Society has reached the point where one can push a button and be immediately deluged with&#8230;information. This is all very convenient, of course, but if one is not careful there is a danger of losing the ability to think. We must remember that in the end it is the individual human being who must solve the problems.&#8221; </p>
<p>As I have written before, <a class="wp-caption-dd" title="Think before you search!" href="http://www.booleanblackbelt.com/2009/07/the-cardinal-rule-of-e-sourcing/" target="_blank">thinking is the most critical step in the candidate sourcing process</a>, and regardless of &#8220;Artificial Intelligence&#8221; and semantic search marketing hype, applications do not have any true cognitive power, nor do they have the ability to be creative or learn as people do. Thus I could not agree more with Jeffrey Liker&#8217;s (author of <a class="wp-caption-dd" title="Excellent read - I found it quite easy to apply many of the principles to recruiting" href="http://www.amazon.com/Toyota-Way-Jeffrey-Liker/dp/0071392319" target="_blank">The Toyota Way</a>) assessment - &#8221;People do the work, computers move information.&#8221;<span id="more-3080"></span></p>
<h3>Do Not Automate What You Cannot Already Perform Manually</h3>
<p>If you are not a highly proficient Talent Miner, capable of manually querying databases and systems with Boolean logic and a high degree of precision to find not only the obvious and easy to find candidates, but also the those residing in <a class="wp-caption-dd" title="Learn about Hidden Talent Pools of candidates in every database and social network" href="http://www.booleanblackbelt.com/2009/03/how-to-find-candidates-others-dont-and-cant/" target="_blank">Hidden Talent Pools</a> &#8211; how can you even HOPE to begin to evaluate what an automated semantic search solution is claiming to do?</p>
<p>You can&#8217;t.</p>
<p>If you&#8217;re going to buy and use product or service of ANY kind, and you don&#8217;t really understand what it does or even exactly how it does it (beyond the marketing hype), and you can&#8217;t tell if it REALLY does everything it claims to &#8211; how and at what level can you determine if the product or service truly meets your needs and will provide true value?  </p>
<p>If you or your organization struggles with the challenge of finding the right candidates at the right time on a consistent basis, implementing an automated search and match application will not magically solve this &#8220;problem&#8221; for you. The real underlying problem is likely that you or your organization currently does not #1 possess the right skills or the right people who are highly proficient at candidate sourcing and/or #2 have documented highly effective candidate sourcing processes and best practices that are consistently trained to and followed. No software application can fix either or both of those issues.</p>
<p>Jane Beseda, Group VP at Toyota, believes that it is best to &#8220;First work out the manual process, and then automate it. Try to build into the system as much flexibility as you possibly can so you can continue to kaizen the process as your business changes&#8221; because &#8220;&#8230;you can kaizen (continually improve) people processes very easily, but it is hard to kaizen a machine (or application).&#8221; I agree wholeheartedly.</p>
<h3>Master the Sourcing Process Manually, THEN Introduce Automation</h3>
<p>If you or the people on your team don&#8217;t fully understand how to effectively leveraging technology for talent identification and you can&#8217;t perform it manually, I do not recommend implementing an automated search/match solution. It is critical to #1 First develop your skills and ability (or your team&#8217;s) to manually source candidates, #2 Document your sourcing best practices and processes, #3 Make sure that they are consistently trained and applied, and #4 Strive to continually improve them. THEN you can go about evaluating automated search and match solutions because you will actually have the ability to truly assess and understand what the products do, you will be able to determine whether or not they meet your needs and can provide real value to your organization, and you will be able to assess how you can possibly best leverage them into your sourcing efforts.</p>
<p>For example, if you don&#8217;t really comprehend the concept of semantic search and how it can be applied in candidate sourcing (e.g., you can manually leverage semantic search in your Boolean strings at an expert level or you have someone on your team that can), you won&#8217;t be able to tell whether or not a application claiming to leverage semantic search is really effectively doing so, if it will provide any real value to your sourcing efforts, or if it is finding all of the best possible matches in the sources it is searching.</p>
<h3>Speaking of Semantic Search</h3>
<p>Many automated candidate search and match solutions claim to leverage semantic search. <a class="wp-caption-dd" title="Explore these links to a variety of articles on semantic search for sourcing and recruiting" href="http://www.booleanblackbelt.com/category/semantic-search/" target="_blank">I&#8217;ve written many articles on the topic of how semantics can be leveraged for sourcing candidates</a> &#8211; so I won&#8217;t go into too much detail here &#8211; but semantics refers to the study of meaning, as inherent at the levels of words, phrases, and sentences.</p>
<p>Sourcers and recruiters can leverage semantics in their sourcing efforts to more quickly find more relevant results. When the search results match the intended MEANING of the search (the INTENT), there is a semantic similarity between the INTENT of search and its results. In other words – you get what you’re looking for. When search results simply match the search terms but not the INTENT of the search, there only a lexical similarity between the search and its results. In other words – the words match, but you don’t get what you’re looking for.</p>
<p>For example, if I were manually searching for Linux systems administrators &#8211; that is the INTENT of my search &#8211; to find people who have been primarily responsible for administering Linux systems, regardless of exactly how they express that experience in their resume. However, with basic keyword and title search, I will find a mix of relevant results (people who have been responsible for Linux administration) and irrelevant lexical-only match results (people who happen to mention the search terms of Linux and systems and administraion somwehere in their resumes or profiles, but who have not been primarily responsible for Linux systems administration).</p>
<p>However, as a thinking human being, I can truly learn from the search results and continually improve and adapt my search strategies and keywords to leverage the language used specifically by people who have been primarily responsible for administering Linux systems (using semantics &#8211; the actual meaning of specific words and phrases), ensuring more of the results returned actually match the INTENT of my search.</p>
<h3>So How Do Applications Achieve Semantic Search?</h3>
<p>To provide relevant search results, various automated search and match applications claim to:</p>
<ul>
<li>Have “pseudo-AI” – these perform semantic &#8220;concept matching&#8221; based on a taxonomy/ontology (lists of terms, equivalent terms and their relationships)</li>
<li>Perform “natural language” search &#8211; using pre-programmed phrases and/or proximity matching</li>
<li>Execute “context-aware” semantic matching of jobs and resumes</li>
<li>Perform fuzzy matching – returning results matching words that were not specifically searched for by the user, but that the application &#8220;thinks&#8221; is likely to be relevant</li>
<li>Have “full AI” – the software is designed with algorithms to create relationships between words, abbreviations and phrases dynamically and without human intervention</li>
</ul>
<h3>Applications Are Not Mind Readers</h3>
<p>No software application is capable of determining the INTENT of any search &#8211; it is critical to recognize and understand this, because it&#8217;s at the very core of semantic matching, although it&#8217;s not often written about when it comes to sourcing candidates. No application can actually *know* what you&#8217;re looking for &#8211; only YOU know what you&#8217;re trying to find, which is a person who has had some sort of specific experience performing certain responsibilities, often in specific environments.</p>
<p>Essentially, automated search/match applications take either your search terms, a job description, and/or an example resume and make their &#8220;best guess&#8221; as you your intent. Don&#8217;t get me wrong &#8211; some applications do a very good job of taking what you give them and providing some relevant matches. However, you must always be aware that these are just that &#8211; guesses &#8211; and on top of it all, they&#8217;re guesses made by an application, not a person.</p>
<h3>Output is Limited by Input</h3>
<p>Search/match aplications are only as good as their input - if a user is not especially adept at crafting search strings the quality of the results the application can produce will be limited. Additionally, we are all aware the resumes and job descriptions are FAR from the best representations of skills and experience, so the results produced by an application interpreting a resume or a job order is instrinsically limited.</p>
<p>There is no replacement for the cognitive and interpretive power of the human mind &#8211; people are much more capable of &#8220;reading between the lines&#8221; of resumes and job descriptions and getting to the true essence of the required skills and experience to determining how to best approach sourcing efforts to find the right people. Applications can claim to do this, but it&#8217;s an apples to oranges comparison.</p>
<p>Let&#8217;s also remember that a sourcer or recruiter can discuss the position&#8217;s requirements with the hiring manager (in most cases &#8211; if not, they should be able to!) &#8211; this information is typically beyond what any example resume or written job description can (or does) convey. Armed with this critical information and detail, a sourcer or recruiter can translate what they&#8217;ve gathered verbally from the hiring manager/team into a sourcing strategy, down to the Boolean search string level. The last time I checked, an application cannot do this. There is no replacement for good analytical and interpretive ability &#8211; it&#8217;s precisely why you can&#8217;t fully automate a business analyst&#8217;s, a data analyst&#8217;s or a financial analyst&#8217;s job.</p>
<p>Part of the instrinsic challenge faced by automated search and matching applications is that they are expected to make the leap from &#8220;give me what I said&#8221; (match the words) to &#8220;give me what I want&#8221; &#8211; matching your intent. Considering that an application is incapable of &#8220;understanding&#8221; your intent, how can it hope to deliver results that match your intent when your intent cannot be interpreted from an example resume, a job description, or a poorly defined search? </p>
<h3>Intrinsic Limitations of Automated Semantic Search/AI</h3>
<p>Be aware that any automated semantic search/match application is only as good as its programming, taxonomy, and ontology:</p>
<ul>
<li>Pre-programmed lists of “relevant” or related keywords may in fact not be relevant or related to your specific search, and can get outdated quickly</li>
<li>Fuzzy matching is by definition “approximate” or “inexact” matching, the very opposite of &#8220;precise&#8221;</li>
<li>Applications are essentially “guessing” the intent of your search based on the keywords, resume, or job description you feed it &#8211; but finding the best talent is not a guessing game</li>
<li>Some systems will return results with related words – which may in fact NOT be relevant to your specific need &#8211; will you be able to determine which are and which aren&#8217;t?</li>
<li>Applications that claim to &#8220;learn&#8221; may not actually improve the relevance of results over time</li>
</ul>
<h3>The Tough Questions</h3>
<p>Are you capable of evaluating an application &#8220;under the hood?&#8221;  When a vendor tells you that their application is &#8220;trained to identify the equivalent meanings of terms found in resumes,&#8221; do you know how to get to the bottom of exactly how their application accomplishes this, and whether or not their application can actually do it for your specific hiring profiles? Could you manually run searches to objectively evaluate the vendor&#8217;s claims? If you don&#8217;t already possess the ability to manually source candidates from information systems, what will you do when your automated search/match application fails to produce the right results, or enough of them? Do you have the ability to search your own database to find the candidates that the search/match application can not find/misses? </p>
<h3>My Experience and Opinion</h3>
<p>I&#8217;ve had the opportunity to use a number of automated semantic search/match applications. I&#8217;m also currently implementing one and I&#8217;m involved in customizing the taxonomies and ontologies. I&#8217;m very excited about the features and capability of the semantic search application I&#8217;m using. However - I&#8217;ve found it does have it&#8217;s limitations, and that there is no perfect solution that magically produces the best candidates available with the push of a button, nor can an automated intelligent/semantic search application fully replace a skilled sourcer or recruiter. Augment and empower - yes. Replace &#8211; no.</p>
<p>As someone who is highly proficient in manual Boolean and semantic search, from everything I have gathered, most automated search and match applications simply automate basic sourcing best practices, but none that I have used do so completely or flawlessly.</p>
<p>In fact, some search/match apps appear to do little more than heavily leverage common title and skill terminology matching &#8211; which any sourcer or recruiter of average skill can accomplish. Other applications go overboard and get sloppy in the process, suggesting and incorporating alternate search terms it *thinks* are related and relevant to your search &#8211; and in some cases the terms are actually NOT relevant to the intent of the user&#8217;s search, causing more harm than good. But if you don&#8217;t know what to look for (past the marketing and past the huge cloud of &#8220;related&#8221; search terms the app suggests) and you don&#8217;t know how to get to the core of the search logic and the true relevance based on the intent of your search, you might just actually believe you have the solution to all of your candidate sourcing problems.</p>
<p>While some industries can benefit heavily from automated search and match applications &#8211; for example,  those with simple and highly consistent titles, <a class="wp-caption-dd" title="What exactly is &quot;lexicology?&quot;" href="http://en.wikipedia.org/wiki/Lexicology" target="_blank">lexicology</a>, and <a class="wp-caption-dd" title="What exactly is &quot;phraseology&quot; and how does it differ from &quot;lexicology?&quot;" href="http://en.wikipedia.org/wiki/Phraseology" target="_blank">phraseology</a>(such as finance and accounting), others vary widely and change rapidly (such as Information Technology) and pose serious challenges for vendors of automated search and match solutions.</p>
<h3>Check Your Reasoning</h3>
<p>If you&#8217;re looking to implement an automated candidate search/match application, simply ask yourself why &#8211; and dig down to the very root of it. Is it because Boolean searching is &#8220;too hard?&#8221; Is it because you think the application can reduce your costs by reducing headcount? Do you think a search/match app can speed up the talent identification cycle?</p>
<p>I&#8217;ll let you in on a little secret &#8211; Boolean search isn&#8217;t that hard with the proper training. It&#8217;s more science than art, really. Which is good news, because manual talent mining via Boolean search strings can be broken down to repeatable steps, including the interpretive and analytical processes, which can be continually and dynamically improved &#8211; whereas an application cannot (easily, or at all). </p>
<p>Also &#8211; semantic search/match applications are not a replacement for people, because the applications don&#8217;t actually perform real work or deliver value to the end customer. When I refer to &#8220;work&#8221; and &#8220;value,&#8221; I&#8217;m referring to how Lean and the Toyota Production System define those terms. Recall Jeffrey Liker&#8217;s assessment, &#8220;People do the work, computers move the information.&#8221; The author of <a class="wp-caption-dd" title="Excellent read - I found it quite easy to apply many of the principles to recruiting" href="http://www.amazon.com/Toyota-Way-Jeffrey-Liker/dp/0071392319" target="_blank">The Toyota Way</a> has also explained that &#8221;&#8230;the only thing that adds value in any type of process&#8230;is the physical or information transformation of that product, service, or activity into something the customer wants.&#8221;</p>
<p>By customer, Jeffrey means the END customer &#8211; the internal hiring manager or the external client. Automated search/match apps don&#8217;t transform or produce a product that the end client wants &#8211; which is a talented person who is a great match for their need. Automated search/match applications produce resumes and profiles (i.e. &#8220;computers move the information&#8221;) and the sourcers and/or recruiters analyze and transform those resumes/profiles into fully screened, closed, and qualified candidates for the end customer (i.e., people do the work). </p>
<h3>My Suggestions</h3>
<p>Automated search/match applications definitely have their place in world of sourcing and recruiting. I think they are best used to facilitate and augment the talent identification efforts of sourcers and recruiters. Some can be especially useful in quickly and simultaneously searching multiple online sources of resumes and social media profiles and parsing large volumes of results into your internal ATS/CRM &#8211; this quick and permanent human capital data capture can be a major benefit of using some search/match solutions.</p>
<p>Ultimately, I feel that sourcers and recruiters using AI/semantic search applications should utilize the results they produce and return as &#8221;suggested reading&#8221; - but I would never rely solely on an application to exhaustively identify top talent, just as no one would trust a plane full of people to take off and land on autopilot. </p>
<p>Whether or not you are considering purchasing an automated search and match application or service, or you&#8217;ve already implemented one, I strongly urge you to #1 First develop your skills and ability (or your team&#8217;s) to manually source candidates, #2 Document your sourcing best practices and processes, #3 Make sure that they are consistently trained and applied, and #4 Strive to continually improve them. THEN asses the best way to leverage an automated matching solution to augment your talent identification efforts.</p>
<p>Do not take a poorly functioning sourcing process or team and expect to fix it using an automated search/match application. Fine tune your sourcing process and best practices, develop your sourcers and recruiters with exceptional training, and then surgically insert matching technology to enhace them. Technology is a tool that exists to support your people and your processes &#8211; it is not a solution to a problem nor a replacement for a process or a person. </p>
<h3>Conclusion</h3>
<p>I am in no way against inserting semantic search applications into the sourcing function (remember, I&#8217;m using one now!) &#8211; but I feel it must be done for the right reasons AND with a full understanding and capability of performing the sourcing processes manually, else you cannot continually improve your sourcing processes, nor will you be capable of picking up sourcing where the application fails to deliver.  </p>
<p>If finding some candidates is your end goal, then you can feel comfortable using automated search and match solutions to do all of the &#8220;heavy lifting&#8221; for you or your associates when it comes to talent identification. However, if you goal is to find all of the best candidates, then you should use an automated matching application as a tool to support your sourcers and recruiters who are well trained and effective in manual sourcing best practices.</p>
<h3>Caveat Emptor</h3>
<p>If you are looking to purchase an automated candidate search/match solution, or you&#8217;ve already implemented one, and you do not have the expertise or experience associated with assessing and implementing such solutions &#8211; I STRONGLY urge you to seek a neutral, third party HR/Recruiting technology consultant and involve them in the process. It is all too easy to be sold by a vendor&#8217;s marketing and messaging &#8211; but you are at a distinct disadvantage if the vendor knows more than you about the sourcing and matching function &#8211; be it manual OR automated.</p>
<p>Be wary – do not seek to automate a process that which you do not fully understand how to perform manually.</p>
<h3>Partial List of Vendors of Matching Technology:</h3>
<ul>
<li>Pure Discovery: <a href="http://www.purediscovery.com/">http://www.purediscovery.com/</a></li>
<li>Actonomy: <a href="http://www.actonomy.com/">http://www.actonomy.com/</a></li>
<li>Semetric (Engenium): <a href="http://www.krollontrack.com/semetric/">http://www.krollontrack.com/semetric/</a></li>
<li>TalentSpring: <a href="http://www.talentspring.com/">http://www.talentspring.com/</a></li>
<li>Sovren: <a href="http://www.sovren.com/">http://www.sovren.com/</a></li>
<li>BurningGlass: <a href="http://www.burning-glass.com/">http://www.burning-glass.com/</a></li>
<li>ResumeMirror: <a href="http://www.talenttech.com/">http://www.talenttech.com/</a></li>
</ul>
<p>Are you a vendor and want to be added to this list? Would you like me to evaluate your product? Let me know.  Thanks!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2009/07/candidate-search-automation-proceed-with-caution/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Sourcing and Recruiting Resources Page</title>
		<link>http://www.booleanblackbelt.com/2009/06/sourcing-and-recruiting-resources-page/</link>
		<comments>http://www.booleanblackbelt.com/2009/06/sourcing-and-recruiting-resources-page/#comments</comments>
		<pubDate>Fri, 12 Jun 2009 00:40:41 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Boolean]]></category>
		<category><![CDATA[How-To's]]></category>
		<category><![CDATA[Resume Sourcing]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Sourcing Mistakes]]></category>
		<category><![CDATA[Sourcing and Recruiting]]></category>
		<category><![CDATA[Automated Semantic Search]]></category>
		<category><![CDATA[Boolean Black Belt]]></category>
		<category><![CDATA[Boolean Logic]]></category>
		<category><![CDATA[Boolean Operators]]></category>
		<category><![CDATA[E-Sourcing]]></category>
		<category><![CDATA[Extended Boolean]]></category>
		<category><![CDATA[Facebook Recruiting]]></category>
		<category><![CDATA[Facebook Search]]></category>
		<category><![CDATA[Finding Resumes on Google]]></category>
		<category><![CDATA[Google Recruiting]]></category>
		<category><![CDATA[Google Resumes]]></category>
		<category><![CDATA[How To]]></category>
		<category><![CDATA[How to search LinkedIn]]></category>
		<category><![CDATA[Internet Resumes]]></category>
		<category><![CDATA[LinkedIn X-Ray Search]]></category>
		<category><![CDATA[Manual Semantic Search]]></category>
		<category><![CDATA[Private Profiles]]></category>
		<category><![CDATA[Proximity]]></category>
		<category><![CDATA[Recruiting]]></category>
		<category><![CDATA[Resume Matching]]></category>
		<category><![CDATA[Searching Facebook]]></category>
		<category><![CDATA[Searching Twitter]]></category>
		<category><![CDATA[Sourcing]]></category>
		<category><![CDATA[Sourcing Facebook]]></category>
		<category><![CDATA[Sourcing Twitter]]></category>
		<category><![CDATA[Twitter Recruiting]]></category>
		<category><![CDATA[Twitter Search]]></category>
		<category><![CDATA[Weighting]]></category>
		<category><![CDATA[x-ray]]></category>
		<category><![CDATA[x-ray search]]></category>
		<category><![CDATA[x-ray searching]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=2997</guid>
		<description><![CDATA[It&#8217;s been a long time coming, but I have finally gotten around to creating a resources page that essentially contains a &#8220;best of&#8221; compilation of Boolean Black Belt articles. It contains 10 &#8220;How-To&#8221; posts ranging from how to search Linkedin, Twitter, Facebook, and Google for candidates, as well as articles on semantic search, Boolean, extended Boolean, [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F06%2Fsourcing-and-recruiting-resources-page%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F06%2Fsourcing-and-recruiting-resources-page%2F" height="61" width="51" /></a></div><p>It&#8217;s been a long time coming, but I have finally gotten around to creating a resources page that essentially contains a &#8220;best of&#8221; compilation of Boolean Black Belt articles. It contains 10 &#8220;How-To&#8221; posts ranging from how to search Linkedin, Twitter, Facebook, and Google for candidates, as well as articles on semantic search, Boolean, extended Boolean, and the top 15 common e-sourcing mistakes.</p>
<h3><a class="wp-caption-dd" title="Sourcing and Recruiting Resources Page" href="http://www.booleanblackbelt.com/sourcing-recruiting-resources/" target="_self">Here&#8217;s where to find it:</a></h3>
<p><a href="http://www.booleanblackbelt.com/sourcing-recruiting-resources/"><img class="alignnone size-full wp-image-3000" title="Sourcing and Recruiting Resources" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/06/sourcingrecruitingresources1.png" alt="" width="380" height="306" /></a> </p>
<p>Enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2009/06/sourcing-and-recruiting-resources-page/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Maximizing Your E-Sourcing Efforts</title>
		<link>http://www.booleanblackbelt.com/2009/04/maximizing-your-e-sourcing-efforts/</link>
		<comments>http://www.booleanblackbelt.com/2009/04/maximizing-your-e-sourcing-efforts/#comments</comments>
		<pubDate>Mon, 13 Apr 2009 12:00:35 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Artificial Intelligence Matching]]></category>
		<category><![CDATA[Boolean Logic]]></category>
		<category><![CDATA[Exalead]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[How-To's]]></category>
		<category><![CDATA[Internet Sourcing]]></category>
		<category><![CDATA[Job Boards]]></category>
		<category><![CDATA[Proximity Searching]]></category>
		<category><![CDATA[Recruiting Technology]]></category>
		<category><![CDATA[Resume Aggregators]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Sourcing and Recruiting]]></category>
		<category><![CDATA[Talent Warehouse]]></category>
		<category><![CDATA[AI Matching]]></category>
		<category><![CDATA[Boolean Operators]]></category>
		<category><![CDATA[Data Depth]]></category>
		<category><![CDATA[LinkedIn]]></category>
		<category><![CDATA[Presentation]]></category>
		<category><![CDATA[Recruiting]]></category>
		<category><![CDATA[Search Aggregation]]></category>
		<category><![CDATA[Search Aggregators]]></category>
		<category><![CDATA[Search Automation]]></category>
		<category><![CDATA[Searchability]]></category>
		<category><![CDATA[Searching]]></category>
		<category><![CDATA[SlideShare]]></category>
		<category><![CDATA[Sourcing]]></category>
		<category><![CDATA[Time Management]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[x-ray search]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=2317</guid>
		<description><![CDATA[I define E-sourcing as leveraging information systems for active talent identification &#8211; searching the Internet, social media, job board resume databases, and applicant tracking systems to find candidates. The proper use of technology in the sourcing and recruiting process should increase your efficiency, productivity, and effectiveness.  I&#8217;ve created the SlideShare presentation below to cover a number of [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F04%2Fmaximizing-your-e-sourcing-efforts%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F04%2Fmaximizing-your-e-sourcing-efforts%2F" height="61" width="51" /></a></div><p>I define E-sourcing as leveraging information systems for active talent identification &#8211; searching the Internet, social media, job board resume databases, and applicant tracking systems to find candidates. The proper use of technology in the sourcing and recruiting process should increase your efficiency, productivity, and effectiveness.  I&#8217;ve created the SlideShare presentation below to cover a number of different ways for you to maximize your ability to find more of the right people more quickly, to accelerate and enable your recruiting efforts.</p>
<p>Click on the presentation below to review:</p>
<ul>
<li>Boolean operators and common query modifiers</li>
<li>Searching LinkedIn, Twitter, and Facebook</li>
<li>X-Ray searching Social Media</li>
<li>Search automation and aggregation</li>
<li>Semantic search: manual and artificial intelligence matching solutions</li>
<li>Search ROI &#8211; a comparison of the searchability and data depth of the Internet, Social Media, Resume Databases, and ATSs</li>
<li>Talent Warehouse concepts</li>
</ul>
<div style="width:425px;text-align:left" id="__ss_1273647"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/glencathey/power-searching-getting-the-most-out-of-your-esourcing-and-recruiting-efforts-1273647?type=powerpoint" title="Getting the most out of your E-sourcing and recruiting efforts">Getting the most out of your E-sourcing and recruiting efforts</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=powersearchingv3-090410160732-phpapp02&#038;rel=0&#038;stripped_title=power-searching-getting-the-most-out-of-your-esourcing-and-recruiting-efforts-1273647" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=powersearchingv3-090410160732-phpapp02&#038;rel=0&#038;stripped_title=power-searching-getting-the-most-out-of-your-esourcing-and-recruiting-efforts-1273647" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/glencathey">Glen Cathey</a>.</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2009/04/maximizing-your-e-sourcing-efforts/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Semantic Search for Recruiters: Manual vs. Automated</title>
		<link>http://www.booleanblackbelt.com/2009/04/semantic-search-for-recruiters-manual-vs-automated/</link>
		<comments>http://www.booleanblackbelt.com/2009/04/semantic-search-for-recruiters-manual-vs-automated/#comments</comments>
		<pubDate>Wed, 01 Apr 2009 17:37:39 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Artificial Intelligence Matching]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[AI Matching]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Automated Search]]></category>
		<category><![CDATA[Automated Semantic Search]]></category>
		<category><![CDATA[Concept Matching]]></category>
		<category><![CDATA[Configurable proximity]]></category>
		<category><![CDATA[Human Cognition]]></category>
		<category><![CDATA[Limitations of Artificial Intelligence]]></category>
		<category><![CDATA[Manual Semantic Search]]></category>
		<category><![CDATA[NEAR Operator]]></category>
		<category><![CDATA[Proximity Search]]></category>
		<category><![CDATA[Semantic Search for Recruiters]]></category>
		<category><![CDATA[Semantic Search for Sourcers]]></category>
		<category><![CDATA[Semantic search for sourcing and recruiting]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=2131</guid>
		<description><![CDATA[Semantic search has been a hot topic in Internet search for a number of years now and it continues to generate quite the buzz. For example, Google just recently rolled out semantic search capability. However, when I talk about semantic search, I&#8217;m not referring to the semantic web or &#8220;web 3.0.&#8221;
I&#8217;m not so excited about [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F04%2Fsemantic-search-for-recruiters-manual-vs-automated%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F04%2Fsemantic-search-for-recruiters-manual-vs-automated%2F" height="61" width="51" /></a></div><p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/04/semantic-web-by-dullhunk.jpg"><img class="alignright size-full wp-image-2160" title="semantic-web-by-dullhunk" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/04/semantic-web-by-dullhunk.jpg" alt="" width="191" height="240" /></a>Semantic search has been a hot topic in Internet search for a number of years now and it continues to generate quite the buzz. For example, Google just recently rolled out <a class="wp-caption-dd" title="Google's new semantic search capability" href="http://www.pcworld.com/businesscenter/article/161869/google_rolls_out_semantic_search_capabilities.html" target="_blank">semantic search capability</a>. However, when I talk about semantic search, I&#8217;m not referring to the <a class="wp-caption-dd" title="Semantic Web Explained" href="http://en.wikipedia.org/wiki/Semantic_Web" target="_blank">semantic web</a> or <a class="wp-caption-dd" title="Web 3.0/Semantic Search" href="http://www.pcworld.com/businesscenter/article/151058/welcome_to_web_30_semantic_searches.html?/tk=rel_news" target="_blank">&#8220;web 3.0.&#8221;</a></p>
<p>I&#8217;m not so excited about whether or not Google can correctly <a class="wp-caption-dd" title="What is disambiguation?" href="http://en.wikipedia.org/wiki/Word_sense_disambiguation" target="_blank">disambiguate</a> my search on the word &#8220;bark&#8221; and figure out if I mean &#8220;the sound of a dog,&#8221; &#8220;the skin of a tree,&#8221; or &#8220;a three-masted sailing ship.&#8221; I&#8217;m interested in an passionate about semantic search techniques and applications specifically geared towards enabling more effective and efficient talent identification and acquisition.</p>
<h3>So exactly what is semantic search?</h3>
<p>That&#8217;s an excellent question!  If you run a search on Google or even a semantic search engine such as <a class="wp-caption-dd" title="Hakia search" href="http://www.hakia.com/" target="_blank">Hakia</a> or <a class="wp-caption-dd" title="Powerset Search" href="http://www.powerset.com/" target="_blank">Powerset</a> for the phrase &#8220;semantic search,&#8221; you&#8217;ll find many confusing results and very little in the way of an easy-to-understand explanation of the concept. I&#8217;ll attempt to do my best to explain semantic search in a way that is pertinent to sourcing and recruiting.</p>
<p>Quite simply, semantic search can be defined as search techniques and/or applications that attempt to return results that more closely match the &#8220;meaning&#8221;  or intent of the search rather than simply returning results that match the words of the search. &#8220;Semantics&#8221; refers to the study of meaning, as inherent at the levels of words, phrases, and sentences.<span id="more-2131"></span></p>
<h3>What You Want vs. What You Say</h3>
<p>Most sourcers and recruiters search information systems with queries that simply return a collection of words &#8211; words that do not have any <a class="wp-caption-dd" title="Associative meaning explained" href="http://en.wikipedia.org/wiki/Associative_meaning" target="_blank">associative meaning</a> and that are not guaranteed to return relevant results. <a class="wp-caption-dd" title="The definition of relevance" href="http://en.wikipedia.org/wiki/Relevance_(information_retrieval)" target="_blank">Relevance</a> can be defined as the extent to which a search result matches the information need of the person executing the search. In other words, relevant results are what you want &#8211; highly relevant results are search results that match exactly what the searcher is looking for.</p>
<p>It&#8217;s safe to assume that sourcers and recruiters who are searching information systems to identify candidates are actually trying to find people that have specific skills, qualifications, abilities, and experience. However, just because certain words appear in a person’s resume or profile &#8211; it does not mean that the person has been primarily responsible for working with those words (typically skills, technologies, etc.). It&#8217;s important to be able to create Boolean search strings that return results that you actually want &#8211; not just the words you used in the search.</p>
<h3>Lexical vs. Semantic Similarity</h3>
<p>There is often a critical difference between the semantic similarity between a search and its results vs. the <a class="wp-caption-dd" title="Definition of Lexical" href="http://www.merriam-webster.com/dictionary/lexical" target="_blank">lexical</a> similarity between a search and its results. When search results simply match the search terms but not the intended meaning of the search, there only a <a class="wp-caption-dd" title="Definition of Lexical" href="http://www.merriam-webster.com/dictionary/lexical" target="_blank">lexical</a> similarity (the words match) between the search and its results. When the search results match the intended MEANING of the search, there is a semantic similarity between the search and its results.</p>
<p>Sourcers and recruiters leveraging semantic search are seeking to increase the probability that they will return search results of people who can actually do the job they are looking to hire for &#8211; a strong semantic match &#8211; not just people who happen to mention the words of the search somewhere in their resume or profile (lexical matches only, aka &#8211; false positives).</p>
<h3>Automated Semantic Search</h3>
<p>When I refer to &#8220;automated semantic search&#8221; for sourcing and recruiting, I&#8217;m referring to applications and systems that claim to perform semantic search for you. Semantic search applications and engines take your search terms and attempt to &#8220;understand&#8221; the intent of your search (i.e., what you&#8217;re really looking for) and return relevant results.</p>
<p>For example &#8211; various semantic search applications claim to:</p>
<ul>
<li> Have “pseudo-Artificial Intelligence” – performing concept matching based on a back end list of keywords (e.g., CPA is associated with accounting, J2EE is associated with Java, etc.)</li>
<li> Execute “context-aware” semantic matching of job descriptions and resumes</li>
<li> Perform <a class="wp-caption-dd" title="Fuzzy Search defined" href="http://whatis.techtarget.com/definition/0,,sid9_gci1075268,00.html" target="_blank">fuzzy search/matching</a> – returning results that are likely to be relevant</li>
<li> Have “full Artificial Intelligence” – the software is designed with algorithms to create relationships between words, abbreviations and phrases dynamically and without human intervention</li>
</ul>
<p>Partial List of Semantic Vendors (with links):</p>
<ul>
<li><a class="wp-caption-dd" title="PureDiscovery" href="http://www.purediscovery.com/" target="_blank">Pure Discovery</a></li>
<li><a class="wp-caption-dd" title="Actonomy" href="http://www.actonomy.com/" target="_blank">Actonomy</a></li>
<li><a class="wp-caption-dd" title="Semetric" href="http://www.krollontrack.com/semetric/" target="_blank">Semetric (Engenium)</a></li>
<li><a class="wp-caption-dd" title="TalentSpring" href="http://www.talentspring.com/" target="_blank">TalentSpring</a></li>
<li><a class="wp-caption-dd" title="Sovren" href="http://www.sovren.com/" target="_blank">Sovren</a></li>
<li><a class="wp-caption-dd" title="BurningGlass" href="http://www.burning-glass.com/" target="_blank">BurningGlass</a></li>
<li><a class="wp-caption-dd" title="ResumeMirror" href="http://www.talenttech.com/" target="_blank">ResumeMirror</a></li>
<li><a class="wp-caption-dd" title="Autonomy" href="http://www.autonomy.com/" target="_blank">Autonomy</a></li>
</ul>
<h3>Limitations of Automated Semantic Search</h3>
<p>While vendors of semantic search solutions are happy to sell you on the idea that their applications can do all of the work for you in quickly identifying highly qualified candidates, you need to be aware that there are several limitations of automated semantic search.</p>
<p>Semantic search applications that come with pre-programmed lists of “relevant” keywords may in fact return results that are not relevant to your search.  Also, a back-end list of keywords can get outdated quickly in some industries.</p>
<p><a class="wp-caption-dd" title="Fuzzy Search defined" href="http://whatis.techtarget.com/definition/0,,sid9_gci1075268,00.html" target="_blank">Fuzzy matching</a> by it&#8217;s very definition is “approximate” or “inexact” matching. Does that sound like a good thing? In other words, fuzzy search can return results that are *likely* to be relevant to a search argument even when the search does not exactly ask for it.  But who determines the likelihood? Not you.</p>
<p>While fuzzy search/match can help with misspellings and researching unfamiliar terms, I have found it to have limited usefulness for those who know *exactly* what they are searching for.</p>
<p>Many semantic search solutions employing &#8220;artificial intelligence&#8221; often return results that mention words RELATED to the terms you were searching for &#8211; which are words OTHER than the words you actually searched for. While that can sound like a good thing, I have found that in many cases these words that the application &#8220;thinks&#8221; are relevant are in fact NOT relevant to your specific search. This can actually produce more false positive results that do not match the intent of your queries, wasting your time instead of increasing your productivity.</p>
<h3>(Wo)Man vs. Machine &#8211; Human Cognition vs. Artificial Intelligence</h3>
<p>You must take a moment to reflect and realize that an application that claims to perform semantic search is essentially trying to *guess* the intent of your query &#8211; it&#8217;s taking your search terms, example resume, or job description, and guessing what might be relevant to you.  Never lose sight of the fact that the operative word in &#8220;artificial intelligence&#8221; is &#8220;artificial&#8221; &#8211; these applications have no true interpretive/<a class="wp-caption-dd" title="Definition of cognitive" href="http://www.merriam-webster.com/dictionary/cognitive" target="_blank">cognitive</a> power.</p>
<p>When you are conducting a search for candidates, only YOU can determine what results are relevant to you &#8211; or in other words &#8211; what kinds of candidates actually are capable of performing the role you are hiring for. As we all know, resumes and social media profiles are often poor and inaccurate representations of candidate&#8217;s abilities and experience. Talented and experienced sourcers and recruiters go WAY beyond buzzword matching and quickly and intuitively apply significant interpretive analysis when reviewing search results, effectively &#8220;reading between the lines&#8221; to identify talent.</p>
<p>As such, I recommend that the results produced by automated semantic search and &#8220;artificial intelligence&#8221; matching applications be used for “suggested reading” &#8211; kind of like when you are searching for something to buy on Amazon.com and Amazon.com suggests products &#8220;you might also like.&#8221; But please don&#8217;t count on semantic search applications to replace talented sourcers and recruiters. I have found that these solutions tend to work best  with title matching and searching for &#8220;cookie cutter&#8221; roles. However, it must be said that even the most junior sourcer or recruiter can run a search for titles and find candidates.</p>
<h3>A Final Word of Caution Regarding Automated Semantic Search</h3>
<p>You must always keep technology in perspective. I&#8217;d like to quote a passage from The Toyota Way Fieldbook by Jeffrey K. Liker and David Meier:</p>
<p>&#8220;It is not enough to show in the abstract that IT can automate a process or provide more or better information. It must be clear how it will add value and support a well-thought-out and time-tested process. Typically, the process is done well manually before it is automated. The technology supports human decision making-it does not replace it. And the technology should not be used as an excuse to stop thinking and lose focus on kaizen.&#8221;</p>
<p>I could not have said it better myself! Semantic search engines and applications should not be seen as solutions to replace or eliminate human thought, analysis and decision making &#8211; they should be used to support human decision making.</p>
<p>What I see as <strong>EXTREMELY DANGEROUS</strong> is sourcing and recruiting organizations who are excited to implement semantic search/artificial intelligence matching solutions expecting them to &#8220;solve the problem&#8221; of the intrinsic challenges associated with leveraging information systems and human capital data for talent identification.</p>
<p>If you don&#8217;t have anyone on your team or in your organization who is highly proficient in MANUAL semantic search, let alone &#8220;standard&#8221; Boolean search/<a class="wp-caption-dd" title="Talent Mining defined" href="http://www.booleanblackbelt.com/2008/10/talent-mining-what-is-it-anyway/" target="_blank">Talent Mining</a>, how will you be able to assess if the automated solution is actually doing what the vendor is claiming it can do? How will you be able to test and determine if the automated solution is actually finding all of the available candidates, let alone the best available?</p>
<p>YOU CAN&#8217;T!</p>
<p>It is critical that you and your team first master the processes and best practices of manually searching and analyzing human capital data before you fast-forward and attempt to automate a process you don&#8217;t understand in the first place. If my word of caution isn&#8217;t enough for you, take Toyota&#8217;s advice &#8211; they&#8217;ve proven that they know a thing or two about technology and process automation.</p>
<h3>Manual Semantic Search</h3>
<p>While the <a class="wp-caption-dd" title="Semantic Web Explained" href="http://en.wikipedia.org/wiki/Semantic_Web" target="_blank">semantic web</a> and semantic search engines employing artificial intelligence/concept matching get a lot of buzz in the HR, recruiting and staffing industry, the big secret is that you don&#8217;t have to rely on a search engine, system or an application to perform semantic search for you.</p>
<p>Yes &#8211; sourcers and recruiters can perform semantic search manually.  I&#8217;ve referred to this concept as user-generated/user-defined semantic search in previous posts. Instead of letting an application take an artificially educated guess as to the intent of your search and give you results it *thinks* are relevant to you, you can create Boolean search strings that go beyond simply trying to match the words themselves and attempt to delve into the meaning implied by the words &#8211; targeting candidates based on what they DO, not just what they SAY.</p>
<h3>Examples of User-Defined Semantic Search</h3>
<p>#1 BASIC: adding functional/responsibility-related terms to your searches</p>
<ul>
<li> admin*, config*, deploy*, manag*, implement*, audit*, reconcil*</li>
</ul>
<p>#2 MID-LEVEL: NEAR operator (Monster, Exalead…)</p>
<ul>
<li> (implement* or deploy*) NEAR SAP</li>
<li>reconcil* NEAR (accounts or statements)</li>
</ul>
<p>#3 ADVANCED: configurable proximity and variable term weighting (Exalead, Lucene, dtSearch)</p>
<ul>
<li> Exalead: (implement* or deploy*) NEAR/5 SAP</li>
</ul>
<p>If the NEAR operator and configurable proximity searching are new concepts to you, or you&#8217;d like to learn more about how to achieve manual semantic search, I recommend that you read these 5 posts (click on links):</p>
<ul>
<li><a class="wp-caption-dd" title="Semantic Search for Sourcers and Recruiters" href="http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters/" target="_blank">Semantic Search for Sourcers and Recruiters</a></li>
<li><a class="wp-caption-dd" title="Semantic Search for Sourcers and Recruiters Round 2" href="http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters-round-2/" target="_blank">Semantic Search for Sourcers and Recruiters Round 2</a></li>
<li><a class="wp-caption-dd" title="Semantic Search Using the NEAR Boolean Operator" href="http://www.booleanblackbelt.com/2009/01/semantic-search-using-the-near-boolean-operator/" target="_blank">Semantic Search Using the NEAR Boolean Operator</a></li>
<li><a class="wp-caption-dd" title="Achieving Semantic Search Without Proximity Operators" href="http://www.booleanblackbelt.com/2009/01/achieving-semantic-search-without-proximity-operators/" target="_blank">Achieving Semantic Search Without Proximity Operators</a></li>
<li><a class="wp-caption-dd" title="Extended Boolean: Proximity and Weighting" href="http://www.booleanblackbelt.com/2008/11/extended-boolean-proximity-and-weighting/" target="_blank">Extended Boolean: Proximity and Weighting</a></li>
</ul>
<h3>Benefits of Semantic Search</h3>
<p>Quite simply, the reason why semantic search can be such a big deal to sourcers and recruiters is that when executed properly, it can:</p>
<p>#1 Significantly reduce sourcing time by reducing/eliminating “false positive” results of candidates who are not likely to be qualified</p>
<p>#2 Increase a sourcer&#8217;s/recruiter&#8217;s ability to quickly find more appropriately qualified candidates</p>
<p>#3 Enable sourcers and recruiters to move beyond “buzzword bingo” and identify talent based on what they are capable of DOING, not just the words they SAY in their resume/profile</p>
<h3>Conclusion</h3>
<p>Semantic search IS a big deal for sourcers and recruiters. Every day, more information about more people becomes available in the form of human capital data that can be found in resumes, social media profiles, Tweets, blog posts, press releases, etc. The end goal of leveraging information systems for talent identification (fancy speak for searching for candidates) is to quickly find the RIGHT people &#8211; people who are capable of performing the role you are sourcing/recruiting for.</p>
<p>One effective way of doing this is through a combination of manual and automated semantic search &#8211; leveraging search techniques and applications that increase your ability to find people who are more likely to meet or exceed your hiring requirements by tapping into the power of the meaning inherent at the levels of words, phrases, and sentences.</p>
<p>Be sure to fully understand and master the concepts and best practices of manual semantic search before jumping to automate a process you don&#8217;t fully understand and cannot evaluate properly or effectively. While it can be tempting to try and skip the challenges associated with Talent Mining via Boolean search, trying to replace or eliminate human decision making in the process of talent identification and acquisition would be a HUGE mistake!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2009/04/semantic-search-for-recruiters-manual-vs-automated/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Achieving Semantic Search without Proximity Operators</title>
		<link>http://www.booleanblackbelt.com/2009/01/achieving-semantic-search-without-proximity-operators/</link>
		<comments>http://www.booleanblackbelt.com/2009/01/achieving-semantic-search-without-proximity-operators/#comments</comments>
		<pubDate>Wed, 14 Jan 2009 15:18:32 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Semantic search for sourcing and recruiting]]></category>
		<category><![CDATA[Semantic search without proximity]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=775</guid>
		<description><![CDATA[How sourcers and recruiters can achieve semantic search WITHOUT using Boolean proximity search operators such as NEAR
If you&#8217;ve read these 3 posts: Semantic Search for Sourcers and Recruiters, Semantic Search for Sourcers and Recruiters Round 2, and Semantic Search using the NEAR Operator, you already know I am a fan of leveraging semantic search for sourcing [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F01%2Fachieving-semantic-search-without-proximity-operators%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F01%2Fachieving-semantic-search-without-proximity-operators%2F" height="61" width="51" /></a></div><p><strong><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/semantic-sign-by-holoubek.jpg"><img class="alignright size-full wp-image-1052" title="semantic-sign-by-holoubek" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/semantic-sign-by-holoubek.jpg" alt="" width="249" height="307" /></a>How sourcers and recruiters can achieve semantic search WITHOUT using Boolean proximity search operators such as NEAR</strong></p>
<p>If you&#8217;ve read these 3 posts: <a class="wp-caption-dd" title="Semantic Search for Sourcers and Recruiters" href="http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters/" target="_blank">Semantic Search for Sourcers and Recruiters</a>, <a class="wp-caption-dd" title="Semantic Search for Sourcers and Recruiters Round 2" href="http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters-round-2/" target="_blank">Semantic Search for Sourcers and Recruiters Round 2</a>, and <a class="wp-caption-dd" title="Semantic Search using the NEAR Operator" href="http://www.booleanblackbelt.com/2009/01/semantic-search-using-the-near-boolean-operator/" target="_blank">Semantic Search using the NEAR Operator</a>, you already know I am a fan of leveraging semantic search for sourcing and recruiting. I believe that it is critical to go beyond basic buzzword matching when creating Boolean search strings and attempt to tap into semantics &#8211; searching for words in specific contexts to find people based on what they DO, not just the buzzwords they happen to mention in their blog posts, social media profiles, or resumes. </p>
<p>So far, I&#8217;ve only discussed how to achieve semantic search using Boolean proximity operators, such as NEAR.</p>
<p>True user-defined semantic search is best achieved though proximity operators such as NEAR and w/x mainly because of the ability to control sentence structure to search for specific responsibilities and experience rather than keywords alone.</p>
<p>However, although everyone has access to an Internet search engine that does support the Boolean NEAR proximity operator (<a class="wp-caption-dd" title="Exalead Internet search engine" href="http://www.exalead.com/search" target="_blank">Exalead</a>), not everyone has access to Applicant Tracking Systems such as <a class="wp-caption-dd" title="Bullhorn" href="http://www.bullhorn.com/" target="_blank">Bullhorn</a> or online job board resume databases like Monster that support proximity searching, so I am dedicating this post to explaining how sourcers and recruiters can achieve semantic search without using any proximity operators.<span id="more-775"></span></p>
<h3>Traditional Keyword Search</h3>
<p>When most sourcers and recruiters create Boolean search strings, they tend to focus solely on titles, skillsets, and technology buzzwords, like accountant, software engineer, SAP, project manager, FASB, .Net, Java, SQL, etc.</p>
<p>However, while this is a commonly accepted practice, searching for words like those returns many false positive results <em><strong>by design</strong></em>. A false positive result is a result that matches the search terms, but does not meet the information needs of the user.  In other words, a false positive is not a <a class="wp-caption-dd" title="Definition of relevance on Wikipedia" href="http://en.wikipedia.org/wiki/Relevance_(Computer_Science)" target="_blank">relevant</a> result.</p>
<p>If you&#8217;re searching the Internet, creating Boolean search strings looking only for technologies, skillsets and/or titles doesn&#8217;t guarantee anything other than results that mention those words &#8211; not necessarily PEOPLE, which if you are a sourcer or a recruiter I&#8217;m assuming you&#8217;re targeting. </p>
<p>If you&#8217;re searching LinkedIn, your internal resume database/ATS, or an online job board resume database and you get results of people who mention those titles and/or keywords you&#8217;re searching for &#8211; you&#8217;re certainly not guaranteed that those people actually have experience doing what you need them to.</p>
<p>Just because someone mentions Java, J2EE, and Weblogic in their blog, profile, or resume, it does not mean that they have the required experience with those technologies that you need. The same applies for results that mention &#8220;accountant,&#8221; &#8221;project manager,&#8221; or any other title or technical term.</p>
<p>Sometimes the results you get from &#8220;traditional&#8221; keyword searching are of people who mention the terms you&#8217;re looking for, but you can&#8217;t tell if they are remotely qualified for your needs &#8211; so you can contact them to find out if they do or not, and at least network with them if in fact they don&#8217;t.  Other times the results you get from &#8220;traditional&#8221; keyword searching are pure junk &#8211; results that mention the words you&#8217;re looking for but the person is obviously not qualified for anything you might be looking for now, or in the future.</p>
<p>While most sourcers and recruiters accept having to slog through false positives and irrelevant results as inevitable and &#8220;just they way it is,&#8221; I prefer to ask if there is anything we can do about it.</p>
<h3>Semantic Search</h3>
<p>A more effective way to leverage Boolean search strings for talent indentification is to include <a class="wp-caption-dd" title="Definition of function" href="http://www.merriam-webster.com/dictionary/function" target="_blank">functional</a>, or responsibility-related terms in addition to titles and skills. Functional terms can help you target and highlight what the people you&#8217;re targeting actually DO, giving you insight into their experience, capability and level of responsibility – which is what your hiring manager really cares about. Hiring managers don&#8217;t (or shouldn&#8217;t!!!) care about titles or buzzwords &#8211; what&#8217;s really going to come out on the interview is whether or not the candidate has been responsible for the types of things that the manager needs them to have been responsible for in order to perform the job they are considering hiring them for. Right?</p>
<p>Here are a few examples of functional/repsonsibility-related terms you could add to your Boolean search strings in an attempt to search beyond basic skill, title, and technology keywords: </p>
<p>Administer, design, develop, implement, integrate, configure, manage, reconcile, audit, schedule, etc.</p>
<p>Remember that Google auto-stems every search term you enter unless you put it in quotes or precede the search term with a +, so adding any one of those words, as appropriate, should net you all of the word variants. You could also try adding the tilde (~) to functional terms and it will look for synonyms. For example: ~Administer, ~design, ~develop, etc. </p>
<p>On Exalead and the major job boards, you can leverage the asterisk for root word/stemming. For example: Admin*, design*, develop*, etc.</p>
<p>Coupling appropriate functional/responsibility search terms with traditional title and/or technology terms to attempt to perform <strong>imprecise semantic search</strong>. I say &#8220;imprecise&#8221; because without using proximity search operators, you cannot precisely control whether or not someone mentions Java <strong><em>close</em></strong> to the words developing, developed, developer, develop, design, etc. If responsibility terms are not mentioned close to technical/skill terms, the probability isn&#8217;t as high that there is any semantic relation between the two. We won&#8217;t be able to eliminate the false positive results of people who simply mention our search terms somewhere, but don&#8217;t have enough of, any, or the right type of experience. </p>
<p>However, adding functional/responsibility-related search terms does have several benefits:</p>
<ul>
<li>Many Internet search engines automatically favor search terms that are mentioned in close proximity within results, so some search engines may return and rank highly results that do in fact happen to mention functional/responsibility related terms close to technology/skill terms and successfully yield results that have a high semantic similarity to the intent of the search</li>
<li>Simply adding more search terms such as functional/responsibility-related words to your Boolean search strings gives search engines more words to search for and determine ranking and relevance. Results will have their relevance ranked by responsibility related terms in addition to standard buzzwords such as titles and technologies.</li>
<li>With functional/responsibility-related terms added to your Boolean search strings, simply having those terms in the results allows you to scan the results quickly to not just look for buzzwords, but also for words describing what the people DO and have DONE.</li>
</ul>
<h3>Search Examples:</h3>
<p><strong>On Google:</strong><br />
(intitle:resume | inurl:resume) Java J2EE (SOA | SOAP | service) (~design | ~develop) -~job -~jobs</p>
<p><a class="wp-caption-dd" title="Search results" href="http://www.google.com/search?hl=en&amp;q=%28intitle%3Aresume+%7C+inurl%3Aresume%29+Java+J2EE+%28SOA+%7C+SOAP+%7C+service%29+%28%7Edesign+%7C+%7Edevelop%29+-%7Ejob+-%7Ejobs&amp;btnG=Google+Search&amp;aq=f&amp;oq=" target="_blank">View the results</a></p>
<p>(intitle:resume | inurl:resume) UNIX (&#8221;system&#8221; | &#8220;systems&#8221;) ~administer Linux &#8220;Red Hat&#8221; (&#8221;server&#8221; | &#8220;servers&#8221;)  ~design -~job -~jobs</p>
<p><a class="wp-caption-dd" title="Search results" href="http://www.google.com/search?hl=en&amp;q=%28intitle%3Aresume+%7C+inurl%3Aresume%29+UNIX+%28%22system%22+%7C+%22systems%22%29+%7Eadminister+Linux+%22Red+Hat%22+%28%22server%22+%7C+%22servers%22%29++%7Edesign+-%7Ejob+-%7Ejobs&amp;btnG=Search" target="_blank">View the results</a></p>
<p>(intitle:resume | inurl:resume) &#8220;accountant&#8221; reconcile (bank | financial) statement -~job -~jobs -sample</p>
<p><a class="wp-caption-dd" title="Search results" href="http://www.google.com/search?hl=en&amp;q=%28intitle%3Aresume+%7C+inurl%3Aresume%29+%22accountant%22+reconcile+%28bank+%7C+financial%29+statement+-%7Ejob+-%7Ejobs+-sample&amp;btnG=Search" target="_blank">View the results</a></p>
<p><strong>On resume databases and ATS&#8217;s support the asterisk for stemming:</strong><br />
java AND J2EE AND (SOA* OR servic*) AND (design* OR develop*)</p>
<p>UNIX AND system* AND admin* AND Linux AND &#8220;Red Hat&#8221; AND server* AND design*</p>
<p>&#8220;Accountant&#8221; AND reconcil* AND (bank* OR financial) AND statement*</p>
<h3>Conclusion</h3>
<p>While most sourcers and recruiters simply accept the fact that it&#8217;s &#8220;normal&#8221; to get a large false positive/irrelevant results from Boolean search strings, there are many things you can do to decrease false positive and increase the relevance of your results.</p>
<p>Throwing in titles, skillset and technology terms into your Boolean search strings is like playing &#8220;buzzword bingo&#8221; - results are only required to have the words you searched for in them, and just because a document mentions a specific title, Java, Oracle, SQL, UNIX, FASB, SOX, PHP, or any title/skillset/technology term that you might be looking for, <strong><em>it doesn&#8217;t MEAN anything</em></strong> other than the words are present.</p>
<p>Semantics is the study of meaning. The presence of search terms like Java or SOX in a document do not have any intrinsic meaning in and of themselves &#8211; most meaning comes from context. Now, if those words are mention in the context of other words, say &#8211; being responsible for doing specific things with them (e.g., designing portal applications in Java, or performing SOX audits), there IS meaning.</p>
<p>If you consciously decide to add functional/responsibility-related words to your searches (such as perform or design), in conjunction with titles, skillset and tecnhology terms, you increase the probability that you can, <strong><em>by design</em></strong>, return results that are more relevant to you &#8211; results of people who don&#8217;t just mention the search terms, but have been responsible for DOING the kinds of things you need them to have experience with.</p>
<p>Even if you are focused soley on name generation, and certainly if you are sourcing and/or recruiting for specific positions/hiring profiles, you ARE ultimately looking for people with specific experience/capability. Leverage semantics and use functional/responsibility related terms in your Boolean search strings in conjunction with more traditional keywords to increase the relevance of your searches and find the people that have the experience you need.</p>
<p>By the way &#8211; if you can tweak your searches and find more of the right people more quickly &#8211; you&#8217;ve just increased your <a class="wp-caption-dd" title="Definition of productivity " href="http://en.wikipedia.org/wiki/Productivity" target="_blank">productivity</a>. But that&#8217;s a whole &#8216;nother post.</p>
<p> <img src='http://www.booleanblackbelt.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2009/01/achieving-semantic-search-without-proximity-operators/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Semantic Search using the NEAR Boolean Operator</title>
		<link>http://www.booleanblackbelt.com/2009/01/semantic-search-using-the-near-boolean-operator/</link>
		<comments>http://www.booleanblackbelt.com/2009/01/semantic-search-using-the-near-boolean-operator/#comments</comments>
		<pubDate>Mon, 12 Jan 2009 15:00:14 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Exalead]]></category>
		<category><![CDATA[Monster]]></category>
		<category><![CDATA[NEAR Operator]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[LinkedIn]]></category>
		<category><![CDATA[site: command]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=777</guid>
		<description><![CDATA[
This post will cover graphic examples of how to achieve semantic search using the NEAR Boolean operator on Monster and on the Internet via Exalead using Accounting and Information Technology hiring profiles.
First, if you have not done so already, be sure to read these 2 posts that throroughly explain the concepts of user-defined semantic search for sourcing and [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F01%2Fsemantic-search-using-the-near-boolean-operator%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2009%2F01%2Fsemantic-search-using-the-near-boolean-operator%2F" height="61" width="51" /></a></div><p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/the-end-is-near-2-by-chris-young-433.jpg"><img class="alignright size-full wp-image-1028" title="the-end-is-near-2-by-chris-young-433" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/the-end-is-near-2-by-chris-young-433.jpg" alt="" width="251" height="180" /></a></p>
<p>This post will cover graphic examples of how to achieve semantic search using the NEAR Boolean operator on Monster and on the Internet via <a class="wp-caption-dd" title="Exalead Internet Search Engine" href="http://www.exalead.com/search" target="_blank">Exalead</a> using Accounting and Information Technology hiring profiles.</p>
<p>First, if you have not done so already, be sure to read these 2 posts that throroughly explain the concepts of user-defined semantic search for sourcing and recruiting: <a class="wp-caption-dd" title="Semantic Search for Sourcers and Recruiters" href="http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters/" target="_blank">Semantic Search for Sourcers and Recruiters</a>, and <a class="wp-caption-dd" title="Semantic Search for Sourcers and Recruiters Round 2" href="http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters-round-2/" target="_blank">Semantic Search for Sourcers and Recruiters Round 2</a>. </p>
<p>Second, if you&#8217;re not already familiar with the NEAR operator, I highly recommend you read about it in this post: <a class="wp-caption-dd" title="Extended Boolean: Proximity and Weighting" href="http://www.booleanblackbelt.com/2008/11/extended-boolean-proximity-and-weighting/" target="_blank">Extended Boolean &#8211; Proximity and Weighting</a> (look for it in the middle of the post under &#8220;Fixed Proximity&#8221;) before proceeding any further.</p>
<h3>The NEAR operator on Monster</h3>
<p>Monster is the only major job board resume database that recognizes and supports the NEAR Boolean operator. According to Monster&#8217;s <a class="wp-caption-dd" title="Boolean Logic on Monster" href="http://media.monster.com/mm/usen/help/hq/tour/Boolean_Logic.pdf" target="_blank">documentation</a>, the NEAR operator has a maximum proximity of 10 words. For example, the search string software NEAR programmer returns ONLY those resumes that have software and programmer within 10 words of each other. </p>
<p>Let&#8217;s look at a couple of resume snippets that I used from Monster when I wrote a post on <a class="wp-caption-dd" title="Boolean Search Strings for a Sales Tax Manager" href="http://www.booleanblackbelt.com/2009/01/boolean-search-strings-for-a-sales-tax-manager/" target="_blank">Boolean Search Strings for a Sales Tax Manager</a>. What we&#8217;re going to do is go beyond the individual words themselves and look for how the candidates wrote sentences describing specifically what they&#8217;ve been responsible for doing. Then we will use the NEAR operator to create Boolean search strings that go beyond simply trying to match the words themselves and attempting to delve into the meaning implied by the words by targeting sentences describing responsibilities.<span id="more-777"></span></p>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-11.png"><img class="alignnone size-full wp-image-1005" title="tax-manager-11" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-11.png" alt="" width="448" height="258" /></a> </p>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-41.png"><img class="alignnone size-full wp-image-1006" title="tax-manager-41" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-41.png" alt="" width="431" height="194" /></a></p>
<p>Notice in both resume snippets, we see verbs like prepared, supervised, managed in close proximity to words like tax, statements, returns, workpapers, schedules, month and quarter end, personnel, staff, etc.</p>
<p>Let&#8217;s use the NEAR operator to specifically target sentences where candidates are talking about the types of responsibilites we need them to have had experience with:</p>
<p>prepar* NEAR (tax* or statement* or return*) and (manag* or supervis*) NEAR (personnel or accountants or staff) and tax NEAR (manager or supervisor) and (state or local or federal) and (sales or use)</p>
<p>Breaking down that search &#8211; here is exactly what&#8217;s going on with the use of the NEAR operators: </p>
<p><strong>#1 prepar* NEAR (tax* or statement* or return*)</strong></p>
<p>This will require the results to have ANY word starting with the root of tax, statement, or return within 10 words of any word beginning with the root of prepar*</p>
<p><strong>Semantic analysis:</strong><br />
This aspect of the search will be highly likely to return results of resumes that have     sentences mentioning responsibilities such as being responsible for the preparation of tax returns, statements, and returns </p>
<p><strong>#2 (manag* or supervis*) NEAR (personnel or accountants or staff)</strong></p>
<p>This will require the results to have ANY mention of any word starting with the root of manag or supervis within 10 words of personnel, staff, or accountants</p>
<p><strong>Semantic analysis:</strong><br />
This aspect of the search will be highly likely to return results of resumes that have     sentences mentioning responsibilities such as managing and supervising personnel, staff, or accountants  </p>
<p><strong>#3 tax NEAR (manager or supervisor)</strong></p>
<p>This will require ANY mention of the word tax to be within 10 words of the words manager and/or supervisor</p>
<p><strong>Semantic analysis:</strong><br />
This aspect of the search will be highly likely to return results of resumes that have     titles of tax manager or tax supervisor, as well as resumes with sentences mentioning responsibilities such as managing and supervising tax-related work and/or personnel </p>
<p>Here are a few examples of the results returned by the above search that clearly demonstrate the NEAR operator hard and effectively at work:</p>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-5.png"><img class="alignnone size-full wp-image-1009" title="tax-manager-5" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-5.png" alt="" width="448" height="38" /></a></p>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-6.png"><img class="alignnone size-full wp-image-1010" title="tax-manager-6" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-6.png" alt="" width="448" height="66" /></a></p>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-7.png"><img class="alignnone size-full wp-image-1011" title="tax-manager-7" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/tax-manager-7.png" alt="" width="448" height="80" /></a> </p>
<p>If you&#8217;re very observant, you&#8217;ll see that the second snippet has a title of &#8220;Section Manager &#8211; Income Taxes.&#8221; The reason I highlight this is that a traditional title search for titles such as &#8220;Tax Manager&#8221; or &#8220;Manager of Tax&#8221; <strong><em>could not return that result</em></strong> &#8211; a nice example of a candidate that most sourcers and recruiters would not be able to find based on a common title search approach, and proof that <a class="wp-caption-dd" title="Hidden Talent Pools" href="http://www.booleanblackbelt.com/2008/10/the-hidden-talent-pools-in-every-source-of-candidates/" target="_blank">hidden talent pools </a>really do exist.</p>
<p>However, using the NEAR operator gives us a handy alternative to specific title searching, and by design, allows us to find candidates with not-as-common titles such as &#8220;Section Manager &#8211; Income taxes.&#8221; Cool.</p>
<p>Now, while the first 50 results of that search had it&#8217;s fair share of false positives &#8211; such as candidates who are now controllers, CFO&#8217;s, and such, but were at some point in their careers responsible for managing tax reporting and personnel, I want to point something out:</p>
<p>Using the NEAR operator, that search pulled 10 direct hits among the first 50 results that had titles of Tax Manager, Supervisor, or Director.</p>
<p>Then, out of curiosity, I took out the NEAR operators and simply replaced them with ANDs:</p>
<p>prepar* and (tax* or statement* or return*) and (manag* or supervis*) and (personnel or accountants or staff) and tax and (manager or supervisor) and (state or local or federal) and (sales or use)</p>
<p>Only 4 out of the first 50 results happened to be direct hits of candidates who had titles of Tax Manager, Supervisor, or Director. So we were able to <strong>more than DOUBLE our highly relevant matches</strong> among the first 50 results alone by employing the NEAR operator. Sweet.</p>
<h3>Now let&#8217;s show some love to Information Technology sourcers and recruiters</h3>
<p>Let&#8217;s say you&#8217;re looking for software engineers that, among other things, have deep Java experience as well as specific experience designing portals. This one is easy.</p>
<p>Java and (design* or develop*) and portal* NEAR (design* or develop*)</p>
<p>The double mention of the phrase (design* or develop*) is not redundant.  The first mention is to find any mention of any word starting with the roots design or develop.  The second mention is in conjunction with the NEAR operator, and REQUIRES all results to also have any mention of portal or portals to be within 10 words of any word starting with the root of design or develop.</p>
<p><strong>Semantic analysis:<br />
</strong>Using the NEAR operator to ensure that any mention of the words portal or portals is within 10 words of design* or develop* increases the probability that they are mentioned in the same sentence &#8211; and if they are mentioned in the same sentence &#8211; it&#8217;s highly likely that the person is talking about being responsible for designing/developing portals. Which is exactly what we&#8217;re looking for.</p>
<p>This is specifically different from just throwing all of the words together and HOPING we get some people who have been responsible for portal design/development. <strong><em>Hope is not a strategy.</em></strong> So we&#8217;re using the NEAR operator to target people who ARE responsible for portal design and development, by the design of our Boolean search string.</p>
<p>Here are 3 resume snippets that demonstrate the NEAR operator working its magic:</p>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/portal-1.png"><img class="alignnone size-full wp-image-1012" title="portal-1" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/portal-1.png" alt="" width="448" height="36" /></a></p>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/portal-2.png"><img class="alignnone size-full wp-image-1013" title="portal-2" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/portal-2.png" alt="" width="448" height="108" /></a></p>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/portal-3.png"><img class="alignnone size-full wp-image-1014" title="portal-3" src="http://www.booleanblackbelt.com/wp-content/uploads/2009/01/portal-3.png" alt="" width="448" height="59" /></a></p>
<p>As you can see, using the NEAR operator worked quite nicely &#8211; we targeted the semantics of talking about being responsible for designing/developing portals &#8211; and we got it.</p>
<h3>The NEAR Operator on Exalead</h3>
<p>I realize that not everyone has access to Monster, so let&#8217;s take a look at an Internet search engine that everyone DOES have access to &#8211; <a class="wp-caption-dd" title="Exalead Internet search engine" href="http://www.exalead.com/search" target="_blank">Exalead</a>.</p>
<p>Exalead is the only decent-sized Internet search engine to support proximity searching in the form of the NEAR operator. I say &#8220;decent-sized&#8221; because it&#8217;s not a major search engine in my opinion &#8211; mostly because it does not appear to index near as many pages as Google, Yahoo, Live, or Ask, and this is especially and painfully evident when you do back to back searches using the site: command to x-ray into LinkedIn. More on that in another post (here&#8217;s a quickie &#8211; feel free to run just site:linkedin.com on both Exalead and Google and you will find 10X the total results on Google).</p>
<p>Let&#8217;s take a swing at the Java/portal developer search we used above on Monster and point it towards the Internet via Exalead.</p>
<p>(intitle:resume OR inurl:resume) AND java AND (design* OR develop*) NEAR portal* AND NOT job*</p>
<p>Before we look at the results, be aware that unlike Monster, Exalead&#8217;s NEAR distance is 16 words &#8211; which is getting a little &#8220;out there&#8221; in terms of proximty.  Also &#8211; see how I was able to actually use the asterisk for root word/stemming?  Man that feels good to be able to do that on the Internet. Anyone from Google reading this?</p>
<p><a class="wp-caption-dd" title="Search results on Exalead for design or develop NEAR portal" href="http://www.exalead.com/search/results?q=%28intitle%3Aresume+OR+inurl%3Aresume%29+AND+java+AND+%28design*+OR+develop*%29+NEAR+portal*+AND+NOT+job*&amp;x=45&amp;y=11" target="_blank">Click here for the results</a>.</p>
<p>Pretty nice results, right? If you check the results, all of them mention portal or portals within 16 words of design*/develop*, in most cases resulting in sentences where the candidate specifically talks about being responsible for/performing portal design/development. Which is exactly what we&#8217;re looking for &#8211; NOT just people who happen to mention those words somewhere in their resume. We&#8217;ve leveraged semantics in our search approach rather than reosrting to the &#8221;buzzword bingo&#8221; game.</p>
<p>But wait &#8211; there&#8217;s more! Exalead also supports configurable proximity. So if 16 words is too far of a gap for you and leads to less relevant results, we can tighen that range.</p>
<p>For example, let&#8217;s limit the distance between any mention of design*/develop* and portal* to a maximum of 5 words.</p>
<p>(intitle:resume OR inurl:resume) AND java AND (design* OR develop*) NEAR/5 portal* AND NOT job*</p>
<p><a class="wp-caption-dd" title="Search results on Exalead using NEAR/5" href="http://www.exalead.com/search/results?q=%28intitle%3Aresume+OR+inurl%3Aresume%29+AND+java+AND+%28design*+OR+develop*%29+NEAR%2F5+portal*+AND+NOT+job*&amp;x=32&amp;y=13" target="_blank">Click here for the results</a>.</p>
<p>We&#8217;ve managed to cut the total number of results down significantly, and we&#8217;ve also increased the relevance while reducing false positives. Look at how tight those results are! It&#8217;s because every single result HAS to mention design*/develop* within 5 words or less of portal*.  We are leveraging semantics heavily here because most mentions of those words in such close proximity are in fact referencing responsibility &#8211; we&#8217;ve blown past word matching to nail people talking about doing what we need them to have experience with! Am I the only person excited about this?</p>
<p>However, as exciting as this is and as tight as those results are, we have to be cognizant of the fact that there ARE relevant results we just eliminated.  Yup &#8211; anyone who mentioned design*/develop* at a range of 6-16 words from portal* was wiped away and we did not see them.  Which is okay, as long as you are aware of this and know how to go back and get them.</p>
<p>As I&#8217;ve discussed in many posts, I think the best approach to <a class="wp-caption-dd" title="Secondary Sourcing defined" href="http://en.wikipedia.org/wiki/Sourcing_(personnel)" target="_blank">secondary sourcing </a>is to start tight and highly focused on the most relevant results rather than starting broad and beginning a search buy sifting through tons of false postitives. In other words, if I were target shooting &#8211; I would start with a sniper rifle and try to hit the bullseye, rather than start with a shotgun and just be happy to hit the target. </p>
<h3>Applying NEAR to LinkedIn</h3>
<p>Let&#8217;s use the site: command to use Exalead to x-ray into LinkedIn and exploit Exalead&#8217;s configurable proximity search to find Java developers who have been responsible for designing/developing portals.</p>
<p>site:linkedin.com AND java AND (design* OR develop*) NEAR/8 portal* AND (inurl:pub OR inurl:in) -intitle:directory</p>
<p><a class="wp-caption-dd" title="Exalead search of LinkedIn using the site: command and NEAR/8" href="http://www.exalead.com/search/results?q=site%3Alinkedin.com+AND+java+AND+%28design*+OR+develop*%29+NEAR%2F8+portal*+AND+%28inurl%3Apub+OR+inurl%3Ain%29+-intitle%3Adirectory&amp;x=37&amp;y=8" target="_blank">Click here for the results</a>.</p>
<p>Yup, NEAR works there as well. Every result mentions some mention of design*/develop* within 8 words of portal or portals.  </p>
<p>Also &#8211; did you notice how I did not use the -/minus sign coupled with 4 to 6 things I was trying to avoid like jobs, answers, and such? I have found that using (inurl:pub OR inurl:in) is a little cleaner - it simply targets public profiles and all I need to NOT out is intitle:directory in most cases.</p>
<h3>Conclusion</h3>
<p>Being able to control how close words are mentioned to each other via the NEAR operator enables us to achieve semantic search &#8211; tapping into sentence structure and the power of meaning in language. Instead of throwing a bunch of words together and having to sift through large volumes of irrelevant and false positive results, we can attempt to harness semantics to find people based on what they have experience DOING, not just based on what words they happen to include somewhere in their resume.</p>
<p>Kudos to Monster for being the only major online job board to support the NEAR operator, and props to Exalead for not only supporting NEAR, but going a step further and supporting configurable proximity via NEAR/x.</p>
<p>Can&#8217;t get enough of semantic search for sourcing and recruiting? My next post will cover how to achieve semantic search for sourcing and recruiting without using any proximity operators. </p>
<p>Stay tuned!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2009/01/semantic-search-using-the-near-boolean-operator/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Semantic Search for Sourcers and Recruiters, Round 2</title>
		<link>http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters-round-2/</link>
		<comments>http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters-round-2/#comments</comments>
		<pubDate>Wed, 31 Dec 2008 16:58:14 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Semantic]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=876</guid>
		<description><![CDATA[Looking at my blog stats, it appears I may have struck a nerve when I wrote a post on semantic search for sourcers and recruiters on Monday, December 29th, explaining the concepts of semantic search with regard to how it can be leveraged effectively by sourcers and recruiters.
I received several questions, comments, and inquiries for more information on the subject. If [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2008%2F12%2Fsemantic-search-for-sourcers-and-recruiters-round-2%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2008%2F12%2Fsemantic-search-for-sourcers-and-recruiters-round-2%2F" height="61" width="51" /></a></div><p style="text-align: left;"><a href="http://www.booleanblackbelt.com/wp-content/uploads/2008/12/semantic-antics-by-inju1.jpg"></a><a href="http://www.booleanblackbelt.com/wp-content/uploads/2008/12/semantic-antics-by-inju2.jpg"><img class="alignright size-medium wp-image-898" title="semantic-antics-by-inju2" src="http://www.booleanblackbelt.com/wp-content/uploads/2008/12/semantic-antics-by-inju2.jpg" alt="" width="272" height="267" /></a>Looking at my blog stats, it appears I may have struck a nerve when I wrote a post on <a class="wp-caption-dd" title="Semantic Search for Sourcers and Recruiters" href="http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters/" target="_blank">semantic search for sourcers and recruiters </a>on Monday, December 29th, explaining the concepts of semantic search with regard to how it can be leveraged effectively by sourcers and recruiters.</p>
<p style="text-align: left;">I received several questions, comments, and inquiries for more information on the subject. If you perform some research on the concept, you likely won&#8217;t find much, if anything specifically focused on user-defined semantic search, let alone user-defined semantic search for sourcing and recruiting, so I figured I would take this opportunity to quickly follow up and expand upon the concepts of leveraging semantic search when creating Boolean search strings. </p>
<p><a class="wp-caption-dd" title="Irina Shamaeva" href="http://www.linkedin.com/in/irinashamaeva" target="_blank">Irina Shamaeva</a>, of Boolean Strings fame (on both <a class="wp-caption-dd" title="Boolean Strings Group on LinkedIn" href="http://www.linkedin.com/groups?gid=1176637" target="_blank">LinkedIn</a> and <a class="wp-caption-dd" title="Boolean Strings Group on RecruitingBlogs" href="http://www.recruitingblogs.com/group/booleanstrings" target="_blank">Recruitingblogs</a>), asked a very insightful question about whether or not I would agree that in fact all sourcers and recruiters are trying to perform semantic search when they are looking for candidates who match a job description</p>
<p>While I would agree that most sourcers and recruiters are trying to find people who have had specific experience performing the role and responsibilities required (and desired) of a job description – I have to say that most sourcers and recruiters are NOT actually performing semantic search. In other words, they are not specifically leveraging semantics in their search tactics and strategies. Throwing a collection of search terms into a Boolean search string based on a job description is just that – a collection of words, which do not necessarily imply any meaning in and of themselves.<span id="more-876"></span></p>
<p>Adding additional keywords that may not be explicitly mentioned in a job description, or using the NOT/- operator to eliminate false positive results may (or may not) help narrow search results, and while these are certainly search best practices, they are not instrinsically semantic search. Adding or selectively removing keywords/search terms in many cases simply produces results with the search terms (and without the removed terms) but not implying any responsibility with the search terms.</p>
<p>Using a User Interface Engineering position as an example, there are many people who can mention the words (UI OR user interface OR GUI) in their resume who in fact do not have any significant experience with interface design, even though the words are somewhere in the resume. Even if we added additional UI-related terms such as (wireframe OR human factors OR cognitive OR heuristic) we can still return many resumes of candidates that mention the search terms but who have not been <em><strong>primarily responsible for</strong></em> interface design.</p>
<p>When creating Boolean search strings to find potential candidates &#8211; the whole point is to find people who have specific skills and experience performing the role/responsibilities of the position the sourcer/recruiter is looking to fill &#8211; in other words, <strong><em>relevant results</em></strong>.  Certainly no one sets out to run a Boolean search to specifically find a bunch of people who aren&#8217;t qualified. Any results returned from a search of candidates who do NOT have the skills and experience necessary to perform the responsibilities of the position the sourcer/recruiter is looking to fill are essentially <strong><em>irrelevant results</em></strong>.  Sure, you can build relationships with them and ask for referrals, and perhaps they fit another position, but there is no denying that they simply do not match the intent of the search. </p>
<p>Sourcers and recruiters encounter this all the time when creating Boolean search strings – the search terms are present in the resume, sometimes in large quantity, but the person has not actually DONE what they need them to have done in their career. That is an excellent example of a high <strong><em>lexical similarity</em></strong> between a search and the results (the words match) and low <strong><em>semantic similarity</em></strong> of the search and the results (the person’s experience does NOT match). </p>
<p>High lexical similarity and low semantic similarity between a search and its results can also be evidenced in the “technical skills summary” of most resumes, where a laundry list of skills and technologies are present. But just because something is mentioned in a resume, it does not imply any level of expertise, recency of experience, or even paid experience. Hence someone could mention Java, Eclipse, and Swing in their resume, but not have any paid experience developing applications with them (as in educational experience or at home). This is a normal experience for sourcers and recruiters and so they assume this is simply “the way it is.”</p>
<p>However, if we use the NEAR command (or even better &#8211; the more powerful configurable proximity search operators of Lucene and dtSearch), we could add this to a search string on Monster: (develop* OR design*) NEAR (Java OR Eclipse or Swing), and the results MUST mention Java or Eclipse or Swing within 10 words of develop or design, increasing the likelihood that the results will include resumes that have sentences specifically stating development or design-level responsibility with Java/Eclipse/Swing. This is tapping into semantics – the mere presence of words in a resume does not necessarily imply any meaning, but words in the same sentence do imply meaning in most (but certainly not all) cases.</p>
<p>The NEAR operator and configurable proximity functionality of search applications such as Lucene and dtSearch are the best ways to leverage semantics when searching because they allow you to target sentence structure, such as when people talk about doing X with Y (configuring routers, reconciling reports, administering a server cluster, implementing SAP, customizing interfaces, performing SOX audits, etc.).</p>
<p>We must also recognize that the candidates that sourcers and recruiters search for are decidedly NOT profesisonal resume writers. However, they don&#8217;t have to be. Even though the majority of people are not very good at writing resumes and are clueless as to how sourcers and recruiters search for their resumes and analyze them, most people do create simple sentences with verbs and nouns (e.g., performing audits, designing portals, troubleshooting a server, managing software development, etc.) &#8211; and many people are very direct about their responsibilities/what they do, and we can take advantage of this and target these statements with proximity/semantic searching.</p>
<p>Semantic searching can also be used to specifically defeat the false positives associated with large skill summary/technology lists in resumes.  Most of these summary sections are just that &#8211; lists of technologies - and they are not sentences with subjects and verbs. Creating Boolean search strings employing a proximity search operator such as NEAR can essentially eliminate &#8220;hits&#8221; of search terms buried in lists because they are not sentences with nouns and verbs (e.g., developing applications in Java, supervising accountants, maintaining cancer registries, etc.) and they will not return results of search terms mentioned by themselves.</p>
<p>If you are interested in the concept of semantic search for sourcing and recruiting, stay tuned, as I have 2 more posts focused on semantic search coming in January.</p>
<p>Happy New Year!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters-round-2/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Semantic Search for Sourcers and Recruiters</title>
		<link>http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters/</link>
		<comments>http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters/#comments</comments>
		<pubDate>Mon, 29 Dec 2008 17:12:08 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Extended Boolean]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Recruiting]]></category>
		<category><![CDATA[Sourcing]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=773</guid>
		<description><![CDATA[On Tuesday, December 23, 2008 I wrote a &#8220;Tutorial Tuesday&#8221; post on www.recruitingblogs.com titled, &#8220;What is Semantic Search?,&#8221; explaining the concepts of semantic search with regard to how it can be leveraged effectively by sourcers and recruiters. Now, I am not sure if the fact that it was posted the day before Christmas eve had anything to do [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2008%2F12%2Fsemantic-search-for-sourcers-and-recruiters%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2008%2F12%2Fsemantic-search-for-sourcers-and-recruiters%2F" height="61" width="51" /></a></div><p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2008/12/search-google-paco-calvino1.jpg"><img class="alignright size-medium wp-image-838" title="search-google-paco-calvino1" src="http://www.booleanblackbelt.com/wp-content/uploads/2008/12/search-google-paco-calvino1-300x300.jpg" alt="" width="300" height="300" /></a>On Tuesday, December 23, 2008 I wrote a &#8220;Tutorial Tuesday&#8221; post on <a href="http://www.recruitingblogs.com">www.recruitingblogs.com</a> titled, &#8220;<a class="wp-caption-dd" title="RecruitingBlogs post on &quot;What is Semantic Search?&quot;" href="http://www.recruitingblogs.com/forum/topics/tutorial-tuesday-what-is" target="_blank">What is Semantic Search?,&#8221; </a>explaining the concepts of semantic search with regard to how it can be leveraged effectively by sourcers and recruiters. Now, I am not sure if the fact that it was posted the day before Christmas eve had anything to do with it, but I only received 2 comments on the post so I am not exactly sure how many people actually had the chance to read it.  As such - I am posting it here in its entirety because while semantic search is not well known or understood, it can be powerfully applied to sourcing and recruiting efforts.</p>
<p>When I talk about semantic search, I don&#8217;t mean the <a href="http://en.wikipedia.org/wiki/Semantic_Web">semantic web </a>(which I still think is a LONG way off), I am referring to user-generated and defined semantic search. In other words, a sourcer or recruiter creating Boolean search strings that go beyond simply trying to match the words themselves and attempting to delve into the <em>meaning</em> implied by the words.</p>
<p>In linguistics, semantics refers to the study of meaning, as inherent at the levels of words, phrases, and sentences.</p>
<p>The vast majority of sourcers and recruiters create Boolean search strings that simply return a collection of words &#8211; words that do not have any associative meaning and that are not guaranteed to be relevant with regard to the intent of the search. Relevance can be defined as the extent to which a search result matches the information need based on the intent of the person executing the search. Highly &#8220;relevant&#8221; results = results that match exactly what the searcher is looking for.<span id="more-773"></span></p>
<p>Most sourcers and recruiters are actually trying to find people that have specific skills and experience. Just because certain words appear in a person&#8217;s resume or profile &#8211; it does not mean that the person has been primarily responsible for working with those words (typically skills, technologies, etc.). For example, a knowledgeable sourcer or recruiter knows that documents with the word “account&#8221; mentioned close to the word “executive” will often have a different meaning and relevance than documents that simply mention the words “account&#8221; and &#8220;executive&#8221; located anywhere within them.</p>
<p>This is the critical difference between the semantic similarity between a search and its results vs. the lexical similarity between a search and its results. In other words &#8211; when the search results match the intended MEANING of the search, there is a semantic similarity between the search and its results. When search results simply match the search terms but not the intended meaning of the search, there is a lexical similarity (the words match) between the search and its results.</p>
<p>Semantic search can best be achieved through the use of search interfaces and engines that support proximity searching. Proximity search functionality allows a sourcer or recruiter to control how close specific words are mentioned in relation to other words.</p>
<p>When you are able to control the proximity of words to each other, you can take advantage of linguistics and sentence structure to look for verbs mentioned in close proximity to nouns, which can imply taking action. If a resume mentions (configure OR configured OR configuration) &#8211; which are verbs &#8211; in close proximity to (router OR routers) &#8211; which are nouns &#8211; and within the same sentence, it is highly likely that the writer is talking about being responsible for configuring routers.</p>
<p>A sourcer or recruiter should not be satisfied to merely scan and read resumes of people who simply mention the words &#8220;configure&#8221; and &#8220;routers&#8221; somewhere in the resume &#8211; there are many people who can mention those words somewhere in their resume who have never been specifically responsible for configuring routers. The issue is that just because these words are found in a resume &#8211; the presence of the words themselves does not MEAN anything with regard to what the candidate has specifically been responsible for.</p>
<p>With the appropriate search interface/engine, sourcers and recruiters can craft semantic searches to find people who not only mention specific words such as &#8220;configure&#8221; and &#8220;routers&#8221;, but who have actually had experience configuring routers. Being able to control the proximity of words can enable recruiters to quickly get more results that are semantically relevant to what the recruiter is actually trying to find.</p>
<p>There are 3 main types of proximity searching &#8211; I will focus on what I think are the two most powerful &#8211; fixed proximity search and configurable proximity search.</p>
<p>Fixed proximity search functionality such as the &#8220;extended Boolean&#8221; NEAR operator enables users to search for words or phrases that are mentioned close to other specific words or phrases. The range of the NEAR operator is fixed, typically at 1-10 words.</p>
<p>Did you know that Monster supports the NEAR operator? Many people aren&#8217;t aware of this &#8211; but it&#8217;s the only major job board resume database that I am aware of to do so. Kudos to Monster! It is unfortunate that there are very few people who even know about the NEAR operator, and even fewer still who know how to utilize it to achieve semantic search.</p>
<p>Among Internet search engines &#8211; Google, Yahoo, Live, and Ask do not support proximity searching of any kind &#8211; only Exalead does, to my knowledge. As for Applicant Tracking Systems, I am aware that Bullhorn has integrated <a href="http://lucene.apache.org/java/docs/">Lucene</a>, a free and open source text search engine that suppports configurable proximity, into their search interface</p>
<p>Configurable proximity search goes one step further than fixed proximity, allowing a sourcer or recruiter to precisely control the maximum distance between specific search terms and to return even more relevant results than the NEAR operator. This is because the NEAR operator’s maximum range of 10 words can allow for some non-relevant results to be returned. The farther words are mentioned apart from each other, the less likely it is that they are semantically related. In fact, when two search terms are separated by 10 words, each could be mentioned in separate bullet points or sentences on a resume and be completely unrelated.</p>
<p>However, with configurable proximity, a sourcer or recruiter can choose the maximum distance between search terms. Although search engines supporting configurable proximity vary with their exact syntax, here is an example of a search looking for someone who has been responsible for administering Exchange servers: Windows AND Exchange w/5 admin* AND server*. That search can ONLY return results of resumes or profiles that mention Exchange within 5 words of any word starting with the root of admin (administrator, administration, administer, administered, etc.), regardless of order. A maximum distance of 5 words will dramatically increase the semantic similarity between the search&#8217;s intent and the search results because mentioning those 2 search terms at such a close range makes it more likely that they are mentioned in the same bullet point or sentence and thus more likely to be semantically related. Essentially, this search will only return results of people who specifically mention something about being responsible for administering Exchange in their resume.</p>
<p>Many sourcers and recruiters employing basic search tactics and strategies may unfortunately be simply throwing a bunch of keywords in a search &#8211; and as a result, end up reviewing large volumes of irrelevant results that simply match the search terms they entered (lexical match) in order to &#8220;get lucky&#8221; to find the few results buried among them that are relevant to what they are seeking. This is a huge time drain, is inefficient, and is low yield.</p>
<p>Experts at talent mining seek to craft Boolean search strings designed to reduce irrelevant &#8220;false positive&#8221; results, eliminating those of people who simply mention the words they are searching for somewhere in their resumes or profiles, and go beyond the simple lexical match to achieve semantic search &#8211; finding people whose experience and skills match the essence of their search.</p>
<p>If you don&#8217;t already take advantage of the power of semantic search to quickly find more relevant results when creating your Boolean search strings, now is the perfect time to set it as a resolution for 2009. Make it a goal to move beyond simple buzzword matching and create Boolean searches that target people more based on what they DO, rather than just the words they use in their resume.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2008/12/semantic-search-for-sourcers-and-recruiters/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Extended Boolean: Proximity and Weighting</title>
		<link>http://www.booleanblackbelt.com/2008/11/extended-boolean-proximity-and-weighting/</link>
		<comments>http://www.booleanblackbelt.com/2008/11/extended-boolean-proximity-and-weighting/#comments</comments>
		<pubDate>Mon, 10 Nov 2008 13:55:16 +0000</pubDate>
		<dc:creator>Boolean Black Belt</dc:creator>
				<category><![CDATA[Extended Boolean]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Boolean]]></category>
		<category><![CDATA[Boolean NEAR Operator]]></category>
		<category><![CDATA[Buzzword matching]]></category>
		<category><![CDATA[NEAR Operator]]></category>
		<category><![CDATA[Proximity]]></category>
		<category><![CDATA[Recruiting]]></category>
		<category><![CDATA[Relevance]]></category>
		<category><![CDATA[Sourcing]]></category>
		<category><![CDATA[Weighting]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=327</guid>
		<description><![CDATA[Most sourcing, recruiting, and staffing professionals are familiar with the “standard” Boolean operators of AND, OR, and NOT. However, I have found that few are familiar with “extended” Boolean functionality, such as proximity (or adjacency) and term weighting.
Beyond Basic Boolean
Extended Boolean offers sourcers and recruiters significantly more control, power and precision when executing searches, and [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2008%2F11%2Fextended-boolean-proximity-and-weighting%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2008%2F11%2Fextended-boolean-proximity-and-weighting%2F" height="61" width="51" /></a></div><p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2008/11/boolean-word-scramble-by-kipbot.png"><img class="size-medium wp-image-344 alignright" title="boolean-word-scramble-by-kipbot" src="http://www.booleanblackbelt.com/wp-content/uploads/2008/11/boolean-word-scramble-by-kipbot-300x89.png" alt="" width="300" height="89" /></a>Most sourcing, recruiting, and staffing professionals are familiar with the “standard” Boolean operators of AND, OR, and NOT. However, I have found that few are familiar with “extended” Boolean functionality, such as proximity (or adjacency) and term weighting.</p>
<h3>Beyond Basic Boolean</h3>
<p>Extended Boolean offers sourcers and recruiters significantly more control, power and precision when executing searches, and in the hands of an expert – extended Boolean can enable semantic search. Semantic search uses the science of meaning in language to produce highly relevant search results rather than have a user sort through a list of loosely related keyword results.</p>
<h3>Relevance is Key</h3>
<p>Ultimately, any sourcing or recruiting professional knows that what’s most critical in running Boolean searches on the Internet, a job board, or in an internal resume database, is getting relevant results. According to Wikipedia, “<a class="wp-caption-dd" title="Definition of relevance on Wikipedia" href="http://en.wikipedia.org/wiki/Relevance_(information_retrieval)" target="_blank">relevance</a>” denotes how well a retrieved set of documents (or a single document) meets the information need of the user.</p>
<p>For sourcing and recruiting, relevant results are typically defined as resumes or profiles of (or information about) potential candidates whose experience and capabilities closely match the hiring profile or job opening that the sourcer or recruiter is trying to find candidates for.</p>
<p>I’d argue that the value of any source of information (resume database, the Internet, etc.) lies less in the information contained within, and more in the ability of a user to extract out precisely and completely what the user needs – finding and retrieving any and all appropriately qualified candidates. Information has no value to you if you are unable to find it and take action on it.</p>
<p>So how can extended Boolean help sourcers and recruiters find more relevant results? Let’s take a look at term weighting first. <span id="more-327"></span></p>
<h3>Variable Term Weighting</h3>
<p>Talented sourcers and recruiters know that not all terms are equally important in a query. In most queries and searches, certain search terms are more important than others. When running standard Boolean queries, all search terms are considered/weighted equally. Unfortunately, many search engines and database search interfaces simply assign relevance to results by the number of search term “hits” in each document. In most cases, the simple frequency of search terms does not correlate to relevant results. This is where the derisive description “buzzword matching” comes from, most often used to denote that there is little skill involved in running Boolean searches counting matched keywords.</p>
<p>Using an Information technology hiring profile as an example – if a sourcer was looking for candidates who have significant experience administering Windows servers and Exchange email servers they might create a simple Boolean query such as this: Windows AND Exchange AND server* and admin*. That search is highly likely to return and rank candidates who are Windows systems administrators who mention Windows many times in their resume/profile and happen to mention Exchange once or twice as highly relevant because of the number of “hits” for Windows – which is by nature a very common term in resumes. This would leave the sourcer with having to sort through a large volume of results to find the candidates who actually have been primarily responsible for administering Exchange servers as well as Windows servers.</p>
<p>Search engines that offer users the ability to assign different weights to each search term enable sourcers and recruiters to move beyond simple buzzword matching and take control of the relevance of the results. Essentially, with variable term weighting you can assign a number value to words to increase their weight when ranking retrieved documents – which does not change the TOTAL number of results, but the ORDER of the results.</p>
<p>Using the same example as above, a sourcer using a search engine that supports variable term weighting could create a Boolean search string such as this: Windows AND Exchange:30 AND server* and admin*. That Boolean query will pull the same number of results as the first search that had no term weighting – however, it will sort and rank the results heavily favoring resumes/profiles that mention Exchange more often in relation to the other search terms, increasing the likelihood that the sourcer can quickly identify candidates who have had experience being responsible for administering and supporting Exchange servers. By employing variable term weighting, the sourcer has increased the relevance of the results.</p>
<p>Now, let’s take a look at proximity functionality:</p>
<h3>Proximity</h3>
<p>Proximity search functionality enables a user to search for specific terms that are mentioned close to other specific terms. An adept sourcer or recruiter knows that documents with the word “computer” mentioned close to the word “science” will often have a different meaning and relevance than documents that simply mention the words “computer” and “science” anywhere within them.</p>
<p>There are 3 main types of proximity searching: fixed proximity, variable proximity, and adjacency. For the purposes of this post – I will focus only on fixed and variable proximity.</p>
<h3>Fixed Proximity</h3>
<p>Fixed proximity is most commonly represented by the NEAR operator. The search engines that do recognize and support the NEAR operator typically define NEAR proximity as within 1 to 10-16 words (specific search engines can differ – check their documentation).</p>
<p>Using the example of a Windows and Exchange administrator, a sourcer could craft this search using the NEAR operator: Windows and Exchange NEAR admin* and server*. That search will ONLY return results of resumes/profiles that mention Exchange within 1 to 16 words of any word starting with the root of admin (administrator, administration, administer, administered, etc.). Being able to control the fact that Exchange MUST be mentioned within close proximity to admin* will dramatically affect and improve the relevance of the search results, typically returning results of candidates who either have a title using both terms and/or candidates that talk about being responsible for Exchange administration.</p>
<div>Here are some examples taken from actual resumes that demonstrate the variety of relevant results that can be retrieved with the above search:</div>
<ul>
<li>Managed &amp; administered more than 300 Exchange Servers</li>
<li>Provisioned &amp; administer multiple Exchange 5.5/2003 servers</li>
<li>Not only are there administration duties for Exchange and Blackberry&#8230;</li>
<li>Exchange/RightFax administrator</li>
<li>Installing, Configuring, and Administering Microsoft Exchange 2000 Server</li>
<li>Administer a Microsoft Exchange 2003/2007 environment</li>
<li>8+ years of expertise as a System Administrator in Windows 2003 family, Windows 2000 family, MS Exchange 5.5, MS Exchange 2000, and Exchange 2003</li>
<li>I am proficient with the following skills; planning, installation and administration of Windows Active Directory, Windows Servers, Exchange Server</li>
<li>Windows Server Support, Active Directory,Exchange Server 2000, 2003 administration and Blackberry Server administration</li>
<li>Administer Exchange 2003 Server for corporate email</li>
</ul>
<p>As you can see, being able to control the proximity of specific search terms essentially increases the likelihood of returning results of candidates who have had administrative responsibility for Exchange servers, effectively increasing the relevance of the results.</p>
<h3>Fun fact:</h3>
<ul>
<li> Did you know that Monster and Exalead support the NEAR operator?</li>
</ul>
<h3>Configurable Proximity</h3>
<p>A search engine that supports configurable proximity affords users the ability to precisely control the distance between specific search terms. This can produce even more relevant results than the NEAR operator, because the NEAR operator’s maximum range of 10-16 can allow for some non-relevant results to be returned. The farther words are mentioned apart from each other, the less likely it is that they are semantically related. In fact, at 10-16 words, each could be mentioned in separate bullet points or sentences on a resume and be completely unrelated.</p>
<p>However, with configurable proximity, a sourcer or recruiter can choose the maximum distance between search terms. Although search engines vary with their exact syntax, here is an example of the Windows and Exchange admin search using configurable proximity: Windows and Exchange w/5 admin* and server*. That search can ONLY return results of resumes or profiles that mention Exchange within 5 words of any word starting with the root of admin (administrator, administration, administer, administered, etc.), regardless of order. A maximum distance of 5 words will dramatically increase the relevance of the search results because mentioning those 2 search terms at such a close range makes it more likely that they are mentioned in the same bullet point or sentence and thus more likely to be semantically related. Essentially, this search will only return results of people who specifically mention something about being responsible for administering Exchange at least once in their resume. By employing this kind of search, a sourcer is actually performing a semantic search, as they are looking specifically for people who talk about having a particular responsibility – not just looking for documents that contain words.</p>
<h3>Fun facts:</h3>
<ul>
<li>Did you know that <a class="wp-caption-dd" title="Exalead search" href="http://www.exalead.com/search" target="_blank">Exalead</a> supports configurable proximity searching?</li>
<li>Did you know that you can integrate a free, open source search engine that supports configurable proximity and variable term weighting into your ATS or resume database? Check out <a class="wp-caption" title="Lucene open source search engine" href="http://lucene.apache.org/java/docs/" target="_blank">Lucene</a>.</li>
</ul>
<h3>Conclusion</h3>
<p>Hopefully you can see how being able to control the proximity of two search terms can yield results that are FAR more relevant than results that simply mention the two terms anywhere in a document or form – this is the critical difference between the semantic similarity between a search and its results vs. the lexical similarity between a search and its results.</p>
<p>There are countless ways you can apply extended Boolean functionality such as variable term weighting and proximity searching to nearly any industry/hiring profile to create searches that return highly relevant results - results that are more relevant than those that can be acheived with standard Boolean logic. Using a search engine that supports both variable proximity and variable term weighting can empower sourcers and recruiters to quickly find large volumes of highly relevant results, increasing productivity and achieving JIT Talent identification and acquisition.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2008/11/extended-boolean-proximity-and-weighting/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
	</channel>
</rss>
