<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Boolean Black Belt-Sourcing/Recruiting &#187; Artificial Intelligence Matching</title>
	<atom:link href="http://www.booleanblackbelt.com/category/artificial-intelligence-matching/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.booleanblackbelt.com</link>
	<description>Leveraging LinkedIn, Twitter, Social Media, Resume Databases, and the Internet for Sourcing and Recruiting</description>
	<lastBuildDate>Mon, 06 Feb 2012 15:00:28 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
		<item>
		<title>Talent Sourcing: Man vs. AI/Black Box Semantic Search</title>
		<link>http://www.booleanblackbelt.com/2012/01/talent-sourcing-man-vs-aiblack-box-semantic-search/</link>
		<comments>http://www.booleanblackbelt.com/2012/01/talent-sourcing-man-vs-aiblack-box-semantic-search/#comments</comments>
		<pubDate>Mon, 09 Jan 2012 14:00:58 +0000</pubDate>
		<dc:creator>Glen Cathey</dc:creator>
				<category><![CDATA[Artificial Intelligence Matching]]></category>
		<category><![CDATA[Boolean Logic]]></category>
		<category><![CDATA[Dark Matter]]></category>
		<category><![CDATA[Extended Boolean]]></category>
		<category><![CDATA[Future of Sourcing and Recruiting]]></category>
		<category><![CDATA[HCDIR]]></category>
		<category><![CDATA[Human Capital Data]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Recruiting Technology]]></category>
		<category><![CDATA[Resume Sourcing]]></category>
		<category><![CDATA[Search Automation]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Sourcing]]></category>
		<category><![CDATA[Sourcing Automation]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Boolean]]></category>
		<category><![CDATA[Boolean Black Belt]]></category>
		<category><![CDATA[dtSearch]]></category>
		<category><![CDATA[Glen Cathey]]></category>
		<category><![CDATA[hcdir]]></category>
		<category><![CDATA[Human Capital]]></category>
		<category><![CDATA[information retrieval]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[matching solutions]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Recruiting]]></category>
		<category><![CDATA[Resume Matching]]></category>
		<category><![CDATA[resume parsing]]></category>
		<category><![CDATA[Semantic Clustering]]></category>
		<category><![CDATA[Sourcing solutions]]></category>
		<category><![CDATA[Talent Identification]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=10315</guid>
		<description><![CDATA[Back in March 2010, I had the distinct honor of delivering the keynote presentation at SourceCon on the topic of resume search and match solutions claiming to use artificial intelligence in comparison with people using their natural intelligence for talent discovery and identification. Now that nearly 2 years has passed, and given that in that [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2012%2F01%2Ftalent-sourcing-man-vs-aiblack-box-semantic-search%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2012%2F01%2Ftalent-sourcing-man-vs-aiblack-box-semantic-search%2F&amp;source=GlenCathey&amp;style=compact&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2012/01/AI_Brain.png"><img class="alignright  wp-image-10319" title="Talent Sourcing and Matching: Artificial Intelligence and Black Box Semantic Search vs. Human Cognition and Sourcing Capability." src="http://www.booleanblackbelt.com/wp-content/uploads/2012/01/AI_Brain.png" alt="" width="219" height="239" /></a>Back in March 2010, I had the distinct honor of delivering the keynote presentation at <a title="Sourcing News and Knowledge - Beyond the Obvious." href="http://www.sourcecon.com/">SourceCon</a> on the topic of resume search and match solutions claiming to use artificial intelligence in comparison with people using their natural intelligence for talent discovery and identification.</p>
<p>Now that nearly 2 years has passed, and given that in that time I&#8217;ve had even more hands-on experience with a number of the top AI/semantic search applications available (I won&#8217;t be naming names, sorry), I decided it was time to revisit the topic which I am <em><strong>very</strong></em> passionate about.</p>
<p>If you&#8217;ve ever been curious about semantic search applications that &#8220;do the work for you&#8221; when it comes to finding potential candidates, you&#8217;re in the right place, because I&#8217;ve updated the slide deck and published it to Slideshare. Here&#8217;s what you&#8217;ll find in the 86 slide presentation:</p>
<ul>
<li>A deep dive into the deceptively simple challenge of sourcing talent via human capital data (resumes, social network profiles, etc.)</li>
<li>How resume and LinkedIn profile sourcing and matching solutions claiming to use artificial intelligence, semantic search, and <a title="Natural language processing (NLP) is a field of computer science and linguistics concerned with the interactions between computers and human (natural) languages; it began as a branch of artificial intelligence.[1] In theory, natural language processing is a very attractive method of human–computer interaction. Natural language understanding is sometimes referred to as an AI-complete problem because it seems to require extensive knowledge about the outside world and the ability to manipulate it." href="http://en.wikipedia.org/wiki/Natural_language_processing">NLP</a> actually work and achieve their claims</li>
<li>The pros, cons, and limitations of automated/<a title="A black box is a device, system or object which can be viewed solely in terms of its input, output and transfer characteristics without any knowledge of its internal workings. For resume search and match, a black box solution gives you no understanding of exactly WHY it's returned certain results or considers them relevant" href="http://en.wikipedia.org/wiki/Black_box">black box</a> matching solutions</li>
<li>An insightful (and funny!) video of <a title="Dr. Michio Kaku is a theoretical physicist, best-selling author, and popularizer of science. He’s the co-founder of string field theory (a branch of string theory), and continues Einstein’s search to unite the four fundamental forces of nature into one unified theory." href="http://mkaku.org/home/?page_id=5">Dr. Michio Kaku</a> and his thoughts on the limitations of artificial intelligence</li>
<li>Examples of what sourcers and recruiters can do that even the most advanced automated search and match algorithms can’t do</li>
<li>The concept of Human Capital Data <a title="To any sourcer or recruiter not still in the Stone Age, this should sound like a really good description of what you do when you use any sort of technology to find people or information about people: Information retrieval (IR) is the area of study concerned with searching for documents, for information within documents, and for metadata about documents, as well as that of searching structured storage, relational databases, and the World Wide Web. " href="http://en.wikipedia.org/wiki/Information_retrieval">Information Retrieval</a> and Analysis (HCDIR &amp; A)</li>
<li>Boolean and <a title="Extended Boolean typically incorporates the ability to weight each term in a Boolean search string, allowing the searcher to choose which terms are the most relevant, as well as configurable proximity - the ability to specify how close search terms are to each other, which enables powerful semantic search at the sentence level. " href="https://www.google.com/search?aq=f&amp;sourceid=chrome&amp;ie=UTF-8&amp;q=extended+Boolean">extended Boolean</a></li>
<li>Semantic search</li>
<li>Dynamic inference</li>
<li><a title="Dark Matter is a term I use to describe resumes, LinkedIn profiles, and other human capital data that exists to be found, but cannot be retrieved through direct or conventional search methods." href="http://www.booleanblackbelt.com/2011/03/linkedins-dark-matter-undiscovered-profiles/">Dark Matter</a> resumes and social network profiles</li>
<li>What I believe to be the ideal resume search and matching solution</li>
</ul>
<div>Enjoy, and let me know your thoughts.</div>
<div id="__ss_10891808" style="width: 595px;">
<p><strong style="display: block; margin: 12px 0 4px;"><a title="Talent Sourcing and Matching - Artificial Intelligence and Black Box Semantic Search vs. Human Cognition and Sourcing" href="http://www.slideshare.net/glencathey/talent-sourcing-and-matching-artificial-intelligence-and-black-box-semantic-search-vs-human-cognition-and-sourcing" target="_blank">Talent Sourcing and Matching &#8211; Artificial Intelligence and Black Box Semantic Search vs. Human Cognition and Sourcing</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/10891808" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="595" height="497"></iframe></p>
<div style="padding: 5px 0 12px;">View more <a href="http://www.slideshare.net/" target="_blank">presentations</a> from <a href="http://www.slideshare.net/glencathey" target="_blank">Glen Cathey</a></div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2012/01/talent-sourcing-man-vs-aiblack-box-semantic-search/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Why So Many People Stink at Searching</title>
		<link>http://www.booleanblackbelt.com/2011/12/why-so-many-people-stink-at-searching/</link>
		<comments>http://www.booleanblackbelt.com/2011/12/why-so-many-people-stink-at-searching/#comments</comments>
		<pubDate>Mon, 19 Dec 2011 14:00:20 +0000</pubDate>
		<dc:creator>Glen Cathey</dc:creator>
				<category><![CDATA[Artificial Intelligence Matching]]></category>
		<category><![CDATA[Human Capital Data]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Iterative Search]]></category>
		<category><![CDATA[Search Process]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Blackbox Search]]></category>
		<category><![CDATA[Critical Thinking]]></category>
		<category><![CDATA[Dark Matter Search]]></category>
		<category><![CDATA[HCIR]]></category>
		<category><![CDATA[How to get better search results]]></category>
		<category><![CDATA[information retrieval]]></category>
		<category><![CDATA[Intelligent Search]]></category>
		<category><![CDATA[NLP]]></category>
		<category><![CDATA[Search Algorithms]]></category>
		<category><![CDATA[Search Relevance]]></category>
		<category><![CDATA[Search Results]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=10211</guid>
		<description><![CDATA[The trouble with search today is that people put too much trust in search engines &#8211; online, resume, social, or otherwise. I can certainly understand and appreciate why people and companies would want to try and create search engines and solutions that &#8220;do the work for you,&#8221; but unfortunately the &#8220;work&#8221; being referenced here is [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2011%2F12%2Fwhy-so-many-people-stink-at-searching%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2011%2F12%2Fwhy-so-many-people-stink-at-searching%2F&amp;source=GlenCathey&amp;style=compact&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><a href="http://www.flickr.com/photos/stickergiant/4793776078/"><img class="alignright  wp-image-10219" title="Don't implicitly trust any search engine - use your brain, think, and analyze the results for relevance." src="http://www.booleanblackbelt.com/wp-content/uploads/2011/12/Be_Careful_This_Machine_Has_No_Brain_Use_Your_Own_2.png" alt="" width="235" height="203" /></a></p>
<p>The trouble with search today is that people put too much trust in search engines &#8211; online, resume, social, or otherwise.</p>
<p>I can certainly understand and appreciate why people and companies would want to try and create search engines and solutions that &#8220;do the work for you,&#8221; but unfortunately the &#8220;work&#8221; being referenced here is <em><strong>thinking</strong></em>.</p>
<p>I read an article by Clive Thompson in Wired magazine the other day titled, &#8220;<a title="An interesting little article that takes a look into the issues of trusting search engines and not analyzing the search results - essentially, &quot;putting too much trust in the machine.&quot; Critical thinking should never be removed from any search process!" href="http://www.wired.com/magazine/2011/11/st_thompson_searchresults/">Why Johnny Can&#8217;t Search</a>,&#8221; and the author opens up with the common assumption that young people tend to be tech-savvy.</p>
<p>Interestingly, although <a title="Generation Z (also known as Generation M, the Net Generation, or the Internet Generation) is a common name in the US and other Western nations for the group of people born from the early to mid 1990s to the present.[1][2][3][4][5] The generation has grown up with the World Wide Web, which became increasingly available after 1991[6]. The youngest of the generation were born during a minor fertility boom around the time of the US Global financial crisis of the late 2000s decade, ending around the year 2010, with the next unnamed generation succeeding." href="http://en.wikipedia.org/wiki/Generation_Z">Generation Z</a> is also known as the &#8220;Internet Generation&#8221; and is comprised of &#8220;digital natives,&#8221; they apparently aren&#8217;t very good at online search.</p>
<p>The article cites a few studies, including one in which a group of college students were asked to use Google to look up the answers to a handful of questions. The researchers found that the students tended to rely on the top results.</p>
<p>Then the researchers changed the order of the results for some of the students in the experiment.  More often than not, they still went with the (falsely) top-ranked pages.</p>
<p>The professor who ran the experiment concluded that &#8220;students aren’t assessing information sources on their own merit—they’re putting too much trust in the machine.&#8221;</p>
<p>I believe that the vast majority of people put too much trust in the machine &#8211; whether it be Google, LinkedIn, Monster, or their ATS.</p>
<p>Trusting top search results certainly isn&#8217;t limited to Gen Z &#8211; I believe it is a much more widespread issue, which is only exacerbated by <a title="All is not perfect with intelligent search" href="http://www.submitedge.com/news/intelligent-search/">&#8220;intelligent&#8221; search engines</a> and applications using semantic search and <a title="Natural Language Processing, which began as a branch of Artificial Inteliigence" href="http://en.wikipedia.org/wiki/Natural_language_processing">NLP</a> that lull searchers into the false sense of security that the search engine &#8220;knows&#8221; what they&#8217;re looking for.<span id="more-10211"></span></p>
<h2>This is Your Search Without a Brain</h2>
<p>It&#8217;s easy to see why people and companies create search products and services using semantic search and NLP that claim to be able to make searching &#8220;easier&#8221; &#8211; they are looking to sell a product  based on the value of making your life easier, at least when it comes to finding stuff.</p>
<p>If you take a look at some of the marketing materials for intelligent search and match search products, you&#8217;ll find value propositions such as &#8220;Stop wasting time trying to create difficult and complex Boolean search strings,&#8221; &#8221;Let intelligent search and match applications do the work for you,&#8221; and &#8220;A single query will give you the results you need &#8211; no more re-querying, no more waste of time!&#8221;</p>
<p>I love saving time and getting to what I want faster, but my significant issue with &#8220;intelligent search and match&#8221; applications is that they try to determine what&#8217;s relevant to me.</p>
<p>And that&#8217;s a rather large issue, because only I know what I am looking for.</p>
<p>It&#8217;s critical to be reminded that the <a title="In information science and information retrieval, relevance denotes how well a retrieved document or set of documents meets the information need of the user." href="http://en.wikipedia.org/wiki/Relevance_(information_retrieval)">definition of &#8216;relevance,&#8217; specifically with regard to information science and information retrieval</a>, is &#8220;how well a retrieved document or set of documents meets the information need of the user.&#8221;</p>
<p>The only person that can make the judgment of how well a search result meets their information need is the person conducting the search, because it&#8217;s their specific information need.</p>
<p>Any reference to &#8220;relevance&#8221; by a search engine, whether it be Google, Bing, LinkedIn, Monster, etc., is based purely on the keywords, operators, and/or facets used.</p>
<p>Search engines don&#8217;t know what you want &#8211; they only know what you typed into or selected from the search interface.</p>
<p>Poor use of keywords, operators or facets will don&#8217;t stop you from getting results. All searches &#8220;work,&#8221; as I am fond of saying &#8211; but the quality or relevance will likely be low.</p>
<p>Of course, that assumes that the person conducting the search is actually proficient at judging the quality or the relevance of the results &#8211; comparing results to their specific information need and experimenting with different combinations of keywords, operators and facets to look for changes in relevance.</p>
<h2>Related Does Not Equal Relevant</h2>
<p>I personally never implicitly trust first page or top ranked search results online, nor top ranked results on LinkedIn, Monster, or anywhere I search. Some of the best search results I have ever found were buried deep in result sets &#8211; far past where most people would typically review, and essentially in the territory of results the search engine deemed least &#8220;relevant.&#8221; <a title="Indicating disapproval, irritation, impatience or disbelief." href="http://en.wiktionary.org/wiki/pshaw">Pshaw</a>!</p>
<p>One reason for this is because I understand that any search engine I use, no matter how &#8220;dumb&#8221; (straight keyword matching), or &#8220;intelligent&#8221; (semantic/NLP), they can only work with the terms I give it. What do you think the  most &#8220;intelligent&#8221; search engine can do with poor user input?</p>
<p>When it comes to searching, unfortunately everyone&#8217;s a winner, because every search &#8220;works&#8221; and returns results.  The problem is that few searchers know how to critically examine search results for relevance.</p>
<p>Regardless how how &#8220;intelligent&#8221; a search engine might be, it can only try to find terms and concepts related to my user input.</p>
<p>This is an often overlooked but critical issue &#8211; just because terms might be related, <em><strong>it does not mean they are relevant to my information need</strong></em>.</p>
<p>It certainly helps to understand that some of the most relevant search results can&#8217;t actually be retrieved by the obvious keywords, titles or phrases, or even those that a semantic search algorithm deems related to them. In fact, some of the best results simply cannot be directly retrieved &#8211; see my post on <a title="Most searches only return the tip of the iceberg when it comes to available and truly relevant results." href="http://www.booleanblackbelt.com/2011/03/linkedins-dark-matter-undiscovered-profiles/">Dark Matter</a> for more information on the concept.</p>
<p>However, to appreciate the concept that no single search, no matter how enhanced by technology, can find all of the relevant (by human standards and judgment) results available to be retrieved, you have to know a thing or two about information retrieval in the first place.</p>
<p>And if you already lack the ability to critically judge search results and evaluate them for relevance, how can you be expected to be able to evaluate and critically examine the search results returned by intelligent search and match applications?</p>
<p>The &#8220;<a title="In science and engineering, a black box is a device, system or object which can be viewed solely in terms of its input, output and transfer characteristics without any knowledge of its internal workings, that is, its implementation is &quot;opaque&quot; (black)." href="http://en.wikipedia.org/wiki/Black_box">black box</a>&#8221; matching algorithms of intelligent search and match applications pose significant issues to users in that searchers have absolutely no insight as to <em><strong>why</strong></em> the search engine returns the results it does. Without this, what option does a user have other than to implicitly trust the search engine&#8217;s matching algorithm?</p>
<h2>Searching Ain&#8217;t Easy</h2>
<p>Who says search has to be easy anyway?</p>
<p>Just because you might want it to be, should it be? Does it have to be?</p>
<p>Let&#8217;s face it &#8211; a lot of people look for the easy way out. The sheer volume of advertisements pushing diet supplements that claim you can lose a ton of weight without having to watch what you eat and exercise is evidence that people want to get the results they want without working for them.</p>
<p>You know the best way to lose weight? A healthy diet combined with regular exercise. The problem is that eating healthy and exercising regularly is that it requires discipline and hard work.</p>
<p>I&#8217;m not saying there isn&#8217;t a better way to search &#8211; I am a fan of Thomas Edison&#8217;s belief that &#8220;There is always a better way.&#8221;</p>
<p>However, I believe that the better way, specifically when it comes to information retrieval, involves discipline and the hard work of people using <a title="Critical thinking is the process of thinking that questions assumptions. It is a way of deciding whether a claim is true, false; sometimes true, or partly true. The origins of critical thinking can be traced in Western thought to the Socratic method of Ancient Greece and in the East, to the Buddhist kalama sutta and Abhidharma. Critical thinking is an important component of most professions. It is a part of the education process and is increasingly significant as students progress through university to graduate education, although there is debate among educators about its precise meaning and scope.[1]" href="http://en.wikipedia.org/wiki/Critical_thinking">critical thought</a> in the search process &#8211; not short-cutting or completely removing it from the equation.</p>
<p>And I am not alone.</p>
<p>There is already considerable work being done to create new kinds of search systems that <em><strong>depend on </strong><strong>continuous human control of the search process.</strong></em> It&#8217;s called <a title="Human–computer information retrieval (HCIR) is the study of information retrieval techniques that bring human intelligence into the search process. The fields of human–computer interaction (HCI) and information retrieval (IR) have both developed innovative techniques to address the challenge of navigating complex information spaces, but their insights have often failed to cross disciplinary borders. Human–computer information retrieval has emerged in academic research and industry practice to bring together research in the fields of IR and HCI, in order to create new kinds of search systems that depend on continuous human control of the search process." href="http://en.wikipedia.org/wiki/Human%E2%80%93computer_information_retrieval">Human-Computer Information Retrieval (HCIR)</a> - which is the study of information retrieval techniques that bring human intelligence into the search process.</p>
<p>Truly intelligent search systems should not involve limiting or removing human thought, analysis, and influence from the search process &#8211; in fact, they should and can involve and encourage user influence.</p>
<p>When you break it down, the information retrieval process has 2 basic parts:</p>
<ol>
<li>The user enters a query, which is a formal statement of their information need</li>
<li>The search engine returns results</li>
</ol>
<p>The key, in my opinion, is that the search engine should return results in a &#8221;Is this what you were looking for?&#8221; manner and allow you to intelligently refine your results, as opposed to a &#8220;This <em><strong>is</strong></em> what you were looking for&#8221; manner.</p>
<p>There&#8217;s a BIG difference.</p>
<p>The former begs for user influence and input, the latter does not &#8211; it makes the assumption that it found what you wanted</p>
<p>The bottom line is that no matter what you are using to search for information, only <em><strong>you</strong></em> know what you&#8217;re looking for and therefore judge the relevance of the search results returned.</p>
<p>Intelligent search isn&#8217;t easy, because you actually have to think before and after hitting the search button.</p>
<h2>The Intelligent Search Process</h2>
<p>As I have written before, searching should not be a once-and-done affair &#8211; there is no mythical &#8220;once search to find them all.&#8221;</p>
<p><a title="The real “magic” and work of sourcing talent is via human capital data is the iterative, intelligent, and cognitively challenging process of selecting a combination of words and phrases, and in some cases strategically excluding others, analyzing the results returned, making changes to the query based on observed relevance, and repeating the process until an acceptable quantity of highly qualified and well-matched candidates are identified." href="http://www.booleanblackbelt.com/2011/04/sourcing-is-an-investigative-and-iterative-process/">Searching is ideally an iterative process that requires intelligent user input</a>.</p>
<p>Here is an example of an intelligent, iterative search process applied to sourcing talent:</p>
<ol>
<li>Analyzing, understanding, and interpreting job opening/position requirements</li>
<li>Taking that understanding and intelligently selecting titles, skills, technologies, companies, responsibilities, terms, etc. to include (<em><strong>or purposefully exclude!</strong></em>) in a query employing appropriate Boolean operators and/or facets and query modifiers</li>
<li>Critically reviewing the results of the initial search to assess relevance as well as scanning the results for additional and alternate relevant search terms, phrases, and companies</li>
<li>Based upon the observed relevance of and intel gained from the search results, modifying the search string appropriately and running it again</li>
<li>Repeat steps 3 and 4 until an acceptably large volume of highly relevant results is achieved</li>
</ol>
<p>Anyone can enter search terms and hit the &#8220;search&#8221; button, but not everyone can effectively and intelligently search.</p>
<p>Until you&#8217;ve witnessed intelligent and iterative search in action, you likely wouldn&#8217;t know the difference between &#8220;great&#8221; search results, &#8220;good&#8221; search results and &#8220;bad&#8221; search results.</p>
<p>It&#8217;s as dramatic as the difference between and experienced professional offshore fisher, a recreational fisher, and someone going offshore fishing for the first time.</p>
<p>The ocean holds the same fish for everyone fishing it. While a first-time or recreational fisher can get lucky every once in a while, only a person who really knows what they&#8217;re doing can get &#8220;lucky&#8221; on a consistent basis and catch the fish  the recreational fisher only dreams of catching.</p>
<h2>Final Thoughts</h2>
<p>The ability to enter in some search terms and click the &#8220;search&#8221; button doesn&#8217;t convey any supernatural search ability, but it does certainly make people feel like they are good at searching, because unless you mistype something, everyone&#8217;s a winner.</p>
<p>Ultimately, search engines of all types retrieve information, but information requires analysis, and only humans can analyze and interpret for relevance.</p>
<p>Eiji Toyoda, the former President of Toyota Motor Corp., has observed that “Society has reached the point where one can push a button and immediately be deluged with…information. This is all very convenient, of course, but if one is not careful there is a danger of losing the ability to think.”</p>
<p><a title="Critical thinking has been described as “reasonable reflective thinking focused on deciding what to believe or do.”[2] It has also been described as &quot;thinking about thinking.&quot;[3] It has been described in more detail as &quot;the intellectually disciplined process of actively and skillfully conceptualizing, applying, analyzing, synthesizing, and/or evaluating information gathered from, or generated by, observation, experience, reflection, reasoning, or communication, as a guide to belief and action&quot;[4] More recently, critical thinking has been described as &quot;the process of purposeful, self-regulatory judgment, which uses reasoned consideration to evidence, context, conceptualizations, methods, and criteria.&quot;[5] " href="http://en.wikipedia.org/wiki/Critical_thinking">Critical thinking</a> is perhaps <a title="Critical thinking is the skill most demanded by employers around the world when assessing job candidates, according to organisational and people development consultancy, APM Group." href="http://www.nationmultimedia.com/2011/05/04/business/Importance-of-critical-thinking-30154554.html">the most important skill a knowledge worker can possess</a>.</p>
<p>The reason why so many people stink at search is because most people simply don&#8217;t think before or after they search, and they place too much trust in the machine.</p>
<p>Additionally, the quality of the search terms/info entered directly affects the quality of the results. &#8220;Garbage in = garbage out&#8221; certainly applies here. And effective searching is rarely a &#8220;once and done&#8221; affair &#8211; the ability to critically evaluate search results for relevance and successively refine the search criteria to increase relevance is the key to true &#8220;intelligent search.&#8221;</p>
<p>&#8220;<a title="In science and engineering, a black box is a device, system or object which can be viewed solely in terms of its input, output and transfer characteristics without any knowledge of its internal workings, that is, its implementation is &quot;opaque&quot; (black)." href="http://en.wikipedia.org/wiki/Black_box">Black box</a>&#8221; matching algorithms can be wonders of technology and engineering, but they pose significant problems in that searchers have absolutely no insight as to <em><strong>why</strong></em> they return the results they do, and in many cases, the engineers creating these semantic/NLP matching algorithms assume they know what their users are looking for better than the users themselves. <del>I&#8217;m sorry if I am the only person offended by such an assumption.</del></p>
<p>Okay, I&#8217;m not sorry.</p>
<p>I love technology, and I use and have used some of the best matching technology available, but also I know it&#8217;s not a good idea to try to limit or remove intelligent critical thinking from the search process and completely replace it with matching algorithms.</p>
<p>The term human–computer information retrieval was coined by <a title="Learn more about Gary Marchionini" href="http://www.ils.unc.edu/~march/">Gary Marchionini</a> whose main thesis is that “HCIR aims to empower people to explore large-scale information bases <strong><em>but demands that</em></strong> <strong><em>people also take responsibility for this control by expending cognitive and physical energy</em></strong>.” (emphasis mine)</p>
<p>For those who simply want information systems to magically provide them with the most relevant results at the click of a button, you should take special note of the fact that experts in the field of HCIR do not believe that people should step out of the information retrieval process and let semantic search/NLP algorithms/AI be solely responsible for the search process.</p>
<p>If you want to get better search results, use the latest technologies, but don&#8217;t put too much trust in the machine.</p>
<p>Instead, put some skin in the game, take responsibility for the search process, and expend some cognitive energy critically thinking through not only your search input, but also the results for relevance.</p>
<p>&#8220;In the age of information sciences, the most valuable asset is <a title="Knowledge is a familiarity with someone or something unknown, which can include information, facts, descriptions, or skills acquired through experience or education. It can refer to the theoretical or practical understanding of a subject. It can be implicit (as with practical skill or expertise) or explicit (as with the theoretical understanding of a subject); and it can be more or less formal or systematic.[1] In philosophy, the study of knowledge is called epistemology, and the philosopher Plato famously defined knowledge as &quot;justified true belief.&quot; There is however no single agreed upon definition of knowledge, and there are numerous theories to explain it. Knowledge acquisition involves complex cognitive processes: perception, learning, communication, association and reasoning; while knowledge is also said to be related to the capacity of acknowledgment in human beings.[2]" href="http://en.wikipedia.org/wiki/Knowledge">knowledge</a>, which is a creation of human imagination and creativity. We were among the last to comprehend this truth and we will be paying for this oversight for many years to come.&#8221; — Mikhail Gorbachev, 1990</p>
<h2>Strictly For the Search Geeks</h2>
<p>Check out this <a title="The HCIR 2011 Challenge focuses on the case where recall is everything – namely, the problem of information availability. The information availability problem arises when the seeker faces uncertainty as to whether the information of interest is available at all. Instances of this problem include some of the highest-value information tasks, such as those facing national security and legal/patent professionals, who might spend hours or days searching to determine whether the desired information exists." href="https://sites.google.com/site/hcirworkshop/hcir-2011/challenge">HCIR Challenge</a>, and at least read the  introduction which compares and contrasts precision vs. recall, and references iterative query refinement.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2011/12/why-so-many-people-stink-at-searching/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Sourcers and Recruiters &#8211; Don&#8217;t Fear Watson or Semantic Search</title>
		<link>http://www.booleanblackbelt.com/2011/03/sourcers-and-recruiters-dont-fear-watson-or-semantic-search/</link>
		<comments>http://www.booleanblackbelt.com/2011/03/sourcers-and-recruiters-dont-fear-watson-or-semantic-search/#comments</comments>
		<pubDate>Mon, 28 Mar 2011 13:00:09 +0000</pubDate>
		<dc:creator>Glen Cathey</dc:creator>
				<category><![CDATA[Artificial Intelligence Matching]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Talent Intelligence]]></category>
		<category><![CDATA[Artificial Intelligence Resume Matching]]></category>
		<category><![CDATA[The Future of Recruiting]]></category>
		<category><![CDATA[The Future of Sourcing]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=8497</guid>
		<description><![CDATA[I&#8217;ve read a few articles recently talking about IBM&#8217;s Watson and how the technology they developed may be the harbinger of unemployment for people in many professions. Here&#8217;s one from Fortune magazine, asking if IBM&#8217;s Watson will put your job in jeopardy. Here&#8217;s another suggesting that those who train others in Internet, social media, ATS, [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2011%2F03%2Fsourcers-and-recruiters-dont-fear-watson-or-semantic-search%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2011%2F03%2Fsourcers-and-recruiters-dont-fear-watson-or-semantic-search%2F&amp;source=GlenCathey&amp;style=compact&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><a href="http://www.booleanblackbelt.com/wp-content/uploads/2011/03/IBM-Watson21-e1301244151715.jpg"><img class="alignright size-full wp-image-8636" title="IBM Watson wants your job :-)" src="http://www.booleanblackbelt.com/wp-content/uploads/2011/03/IBM-Watson21-e1301244151715.jpg" alt="" width="200" height="181" /></a></p>
<p>I&#8217;ve read a few articles recently talking about IBM&#8217;s Watson and how the technology they developed may be the <a title="one that presages or foreshadows what is to come" href="http://www.merriam-webster.com/dictionary/harbinger">harbinger</a> of unemployment for people in many professions.</p>
<p>Here&#8217;s <a title="Will IBM's Watson be putting you out of a job?" href="http://management.fortune.cnn.com/2011/02/15/will-ibm%E2%80%99s-watson-put-your-job-in-jeopardy/">one</a> from Fortune magazine, <a title="Will IBM's Watson put your job in jeopardy?" href="http://management.fortune.cnn.com/2011/02/15/will-ibm%E2%80%99s-watson-put-your-job-in-jeopardy/">asking if IBM&#8217;s Watson will put your job in jeopardy</a>.</p>
<p>Here&#8217;s <a title="I was not aware that there was a &quot;Boolean Cash Cow&quot; I've certainly never seen one, let alone profited from one. :-)" href="http://www.fistfuloftalent.com/2011/03/21411-the-day-ibms-watson-tapped-out-the-boolean-cash-cow.html">another</a> suggesting that those who train others in Internet, social media, ATS, and resume database sourcing techniques and strategies will be eventually eliminated by semantic search solutions.</p>
<h2>Watson Winning at Jeopardy isn&#8217;t Surprising</h2>
<p>First, let&#8217;s first recognize that it&#8217;s an apples to oranges comparison between Jeopardy and sourcing/recruiting.<span id="more-8497"></span></p>
<p>The ability to quickly research and answer trivia questions (or provide questions for the answers, in the case of Jeopardy) is a far cry from having to boil a hiring need (skills, capabilities, and specific responsibilities in specific industries and environments) down to a series of queries to mine flawed and incomplete human capital data (i.e., resumes and social media profiles) in order to return people who have a high probability of not only being qualified for the position, but also interested in the job (i.e. &#8220;recruitable&#8221;).</p>
<p>With trivia, all of the facts and information are readily accessible, completed and identifiable on the Internet, or in the case of Watson, saved on a multi-TB hard drive array.</p>
<p>It&#8217;s not really shocking that a highly specialized $900,000,000 to $1,800,000,000 (<a title="Watson wasn't cheap!" href="http://money.cnn.com/galleries/2010/technology/1008/gallery.biggest_tech_gambles/3.html?iid=EL">estimated 3 year cost of developing Watson</a>) NLP (Natural Language Processing) computer can sort through 200 million pages of structured and unstructured content, including the full text of Wikipedia, to retrieve information faster than a human relying on memory alone.</p>
<p>Why is anyone surprised that Watson spanked people?</p>
<p>I wasn&#8217;t.</p>
<p>However, I can&#8217;t pass up the opportunity to point out that Watson did make mistakes &#8211; <a title="You can see evidence of the mistake here on Flickr" href="http://www.flickr.com/photos/ken_duffy/5452548946/">here&#8217;s one example in which Watson thought the answer was &#8220;Who is Picasso?&#8221; when the correct answer was &#8220;What is modern art?&#8221;</a></p>
<p>Who knew that the err is human, as well as inhuman?</p>
<p> <img src='http://www.booleanblackbelt.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<h2>The Unique Challenge of Human Capital Data</h2>
<p>Unlike finding the answers to trivia questions, when it comes to finding and identifying qualified and talented people based on their resumes and social media profiles and updates, <strong><em>the information is often incomplete, and in many cases, critical bits of identifying data are simply not present.</em></strong></p>
<p>For example, how do you find someone with <a title="This is a real-world example of a challenge that one of my recruiters is tackling now" href="http://en.wikipedia.org/wiki/Spring_Framework">Spring MVC</a> experience when many people don&#8217;t mention it on their resume, nor on LinkedIn, Twitter, blogs, etc.?</p>
<p>I recently gave the world <a title="When it comes to searching LinkedIn, you don't know what you're missing - literally!" href="http://www.booleanblackbelt.com/2011/03/linkedins-dark-matter-undiscovered-profiles/">a tiny glimpse into the Dark Matter of LinkedIn</a> &#8211; direct keyword, title, and even concept/relational search methods, used by humans or algorithms, can only retrieve results based on existing text.</p>
<p>Quite simply &#8211; if the text isn&#8217;t there to be retrieved or analyzed, a semantic search/NLP algorithm can&#8217;t do anything with it.</p>
<p>Good sourcers really do &#8220;read between the lines&#8221; of both the job description and requirements as well as the human capital data they are searching for and analyzing.</p>
<p>There is <strong><em>much more</em></strong> to high-level sourcing than keyword and title search/match.</p>
<p>There have been semantic solutions on the market for quite some time that can do keyword, title and concept matching reasonably well (as well as some that claim to, but don&#8217;t). The issue with those solutions that no one seems to (or wants to) realize is that they have limitations &#8211; they find some matches, exclude some, and bury others.</p>
<p><strong><em>The real question is who, how, and why are some matches found and ranked highly, while others are excluded, and others ranked lowly but actually represent the best talent?</em></strong></p>
<h2>What Do I Know?</h2>
<p>I have hands-on, practical experience (read: trying to find people to fill real jobs) with many of the &#8220;top shelf&#8221; semantic search applications out there, specifically designed for human capital data, so when I write or speak on the matter of semantic search, I&#8217;m not throwing around empty opinions.</p>
<p>I&#8217;ve seen what these solutions can do, and I&#8217;ve also directly experienced their limitations, including what they simply can&#8217;t do.</p>
<p>Unlike many people who write on the subject of semantic search, I have to personally find people and help others find and recruit talented, qualified candidates with highly specialized skills and experience within 24-48 hours of receiving a client request <strong><em>on a daily basis</em></strong>. If semantic search solutions (including the one I have access to) could speed up that process and help me find more and better candidates faster &#8211; trust me, I would use them!</p>
<p>I&#8217;ve witnessed a sourcer with 9 months of total experience find better qualified matches (and faster) than a big-name semantic search solution in front of one of the senior technical managers responsible for developing the product. It was eye-opening and even somewhat confusing for them, to say the least.</p>
<p>I&#8217;ve also spoken with sourcing/recruiting managers at Fortune 500 companies who have evaluated leading semantic search solutions and they passed on purchasing them because the solutions did not find more and better results faster than their sourcing/recruiting team.</p>
<p>Ultimately, it&#8217;s not about humans vs. technology &#8211; it&#8217;s about results.</p>
<h2>The Solution is Part of the Problem</h2>
<p>I&#8217;ve found the creators of semantic search products don&#8217;t seem to like it nor do they seem to really listen when you point out the flaws and limitations of their creations &#8211; and I&#8217;ve had exchanges with people who hold patents in this space.</p>
<p>I&#8217;ve also gotten the sense from talking with semantic search solutions providers that some of these folks believe that sourcers, recruiters and HR professionals don&#8217;t (and/or can&#8217;t!) really understand semantic search and more complex information retrieval strategies.</p>
<p>To their credit, if their perception (based on experience or otherwise) is that recruiters and HR professionals struggle with Boolean search &#8211; <strong><em>the most basic query &#8220;language&#8221;</em></strong> &#8211; why wouldn&#8217;t they assume that the average recruiter could not possibly understand and appreciate what&#8217;s going on &#8220;under the hood&#8221; of semantic search solutions?</p>
<p>However, it is folly to apply that stereotype to all sourcing, recruiting and HR professionals &#8211; there are plenty of us who actually know more about the specific challenges posed by human capital data and the practical needs and concerns of recruiting organizations than the people who are developing the solutions that we are supposed to intrinsically trust to automatically find the best people available.</p>
<p>I don&#8217;t hate &#8211; I appreciate semantic search. I simply want these solutions to live up to their hype. Semantic search vendors &#8211; listen to your current and potential customers &#8211; they just want your product to work better!</p>
<p>I&#8217;d like to extend an open invitation to any semantic search/NLP vendor &#8211; I will happily evaluate your product and make suggestions for improvements&#8230;for free! If you&#8217;re very confident in your solution, I&#8217;ll also write a review online. If you&#8217;d rather not have your product exposed publicly, I can also evaluate products privately. I really do want to accelerate the efficacy of semantic search applications for sourcing and recruiting!</p>
<p>I also want to educate others who may be buying these kinds of solutions so they are more knowledgeable and informed as to the pros and cons, capabilities and limitations of these solutions, and not sold simply on impressive sales pitches, techno-speak and &#8220;see how many results?&#8221; demonstrations. If you&#8217;re a potential customer of semantic search solutions, please be sure to include your best sourcers/recruiters in the evaluation process &#8211; if the only people who are evaluating a semantic search solution are HR, management, and procurement professionals who don&#8217;t actually search for top talent on a daily basis and won&#8217;t be using the proposed solution, you can easily be sold on a product that doesn&#8217;t actually work as well as you might think based on the sales presentation.</p>
<p>If you&#8217;re looking to buy a new flat screen TV or car, <strong><em>anyone</em></strong> can read reviews online, test drive them and compare them to competing products.  I find it interesting (and telling!) that you can&#8217;t do the same thing when it comes to recruiting and HR software.</p>
<p>When you buy a house &#8211; you get it inspected by a specialized professional before you buy it so you really know what you&#8217;re getting beneath the surface. Before you buy a semantic search solution, you should have it evaluated by a person who specializes in human capital information retrieval (who is also ideally a neutral third party!).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2011/03/sourcers-and-recruiters-dont-fear-watson-or-semantic-search/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Curious About My SourceCon Keynote?</title>
		<link>http://www.booleanblackbelt.com/2010/03/curious-about-my-sourcecon-keynote/</link>
		<comments>http://www.booleanblackbelt.com/2010/03/curious-about-my-sourcecon-keynote/#comments</comments>
		<pubDate>Mon, 08 Mar 2010 16:00:41 +0000</pubDate>
		<dc:creator>Glen Cathey</dc:creator>
				<category><![CDATA[Artificial Intelligence Matching]]></category>
		<category><![CDATA[Recruiting Technology]]></category>
		<category><![CDATA[Resume Sourcing]]></category>
		<category><![CDATA[SourceCon]]></category>
		<category><![CDATA[Sourcing Automation]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Semantic Search]]></category>
		<category><![CDATA[Sourcing]]></category>

		<guid isPermaLink="false">http://www.booleanblackbelt.com/?p=5056</guid>
		<description><![CDATA[Are you attending or thinking about attending SourceCon 2010 in San Diego in March? I am going to be the keynote speaker for the event, and I will be presenting on Artificial Intelligence vs. Human Cognition when it comes to sourcing and matching resumes. If you’re curious to know what kinds of things I’ll be [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2010%2F03%2Fcurious-about-my-sourcecon-keynote%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.booleanblackbelt.com%2F2010%2F03%2Fcurious-about-my-sourcecon-keynote%2F&amp;source=GlenCathey&amp;style=compact&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><img class="alignright size-full wp-image-5060" title="SourceCon2010_GlenCathey_250x250" src="http://www.booleanblackbelt.com/wp-content/uploads/2010/03/SourceCon2010_GlenCathey_250x250.gif" alt="SourceCon2010_GlenCathey_250x250" width="250" height="250" />Are you attending or thinking about attending SourceCon 2010 in San Diego in March?</p>
<p>I am going to be the keynote speaker for the event, and I will be presenting on Artificial Intelligence vs. Human Cognition when it comes to sourcing and matching resumes.</p>
<p>If you’re curious to know what kinds of things I’ll be addressing during the session, here is a sneak peek:</p>
<ol>
<li>The intrinsic and often overlooked challenges associated with sourcing resumes</li>
<li>What artificially intelligent semantic search and match applications claim to do and how they actually work</li>
<li>The limits of artificial intelligence</li>
<li>What people can do that semantic search applications cannot</li>
<li>The 5 levels of semantic search</li>
<li>The 5 levels of secondary/e-sourcing</li>
<li>What I believe would be the ideal candidate sourcing/talent identification solution<span id="more-5056"></span></li>
</ol>
<p>If you’ve ever wondered about the fantastic claims that some of the semantic search application vendors on the market make as to how their solution can mimic a senior recruiter when finding candidates, then you will be very interested in hearing what I have to say about the reality of what they can do.</p>
<p>If you’re a sourcer and you’re concerned that your role/position might eventually be replaced by sourcing software, you will be encouraged by my analysis and supporting arguments that explain why the abilities of creative and investigative sourcers will always be in demand – tomorrow and 50 years from now.</p>
<p>I hope you will be able to attend SourceCon 2010 – I know I’m looking forward to it!</p>
<p>If you&#8217;re unable to attend, the good news is that the presentations will likely be streamed. Additionally, I plan on posting my expanded slide deck, including all talking points &#8211; so you won&#8217;t be stuck staring at some pretty pictures wondering what the heck I talked about. <img src='http://www.booleanblackbelt.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://www.booleanblackbelt.com/2010/03/curious-about-my-sourcecon-keynote/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

