Monday, March 09, 2009

Searching Search Like a Yandex

Let me introduce Yandex. It's an interesting search engine because it precedes Google. In fact, Yandex was founded in the late 1980s, before the advent of the Web. What is interesting is that Yandex is a classic case study that Google is not the end all and be all of search. Google may be good in English, but how does it fare in multilingual searching. (Remember: English is only a fraction of the Internet's languages).

What is interesting is that Yandex's search algorithm is rooted in the highly inflected and very peculiar Russian language. Words can take on some 20 different endings to indicate their relationship to one another. Like the many other non-English languages, this inflection makes the language of Russian precise, but makes search extremely difficult. Google fetches the exact word combination you enter into the search bar, leaving out the slightly different forms that mean similar things. However, Yandex is unique in that it does catch the inflection. Fortune has written an interesting article on Yandex, and my favourite part is its examination into the unique features of this Russian search giant:

While some of its services are similar to offerings available in the U.S. (blog rankings, online banking), it also has developed some applications that only Russians can enjoy, such as an image search engine that eliminates repeated images, a portrait filter that ferrets out faces in an image search, and a real-time traffic report that taps into users' roving cellphone signals to monitor how quickly people are moving through crowded roads in more than a dozen Russian cities.



Thursday, March 05, 2009

BBC's Semantic Web

BBC gets it.   In the latest issue of Nodalities magazine (one of my favourite reads), BBC reveals how it is applying the bottom-up approach to its contribution in realizing the SemWeb.   To make this happen, web programmers broke with BBC tradition by designing from the domain modelup rather than the interface down.  The domain model provided us with a set of objects (brands, series, episodes, versions, ondemands, broadcasts, etc) and their sometimes tangled interrelationships.

This is exciting stuff.  Without ever explicitly talking RDF we’d built a site that complied with Tim Berners-Lee’s four principles for Linked Data:

(1)  Use URIs as names for things. 

(2)  Use HTTP URIs so that people can look up those names. - 

(3)  When someone looks up a URI, provide useful information

(4)  Include links to other URIs

In fact, as the BBC web developers argue, 
considering how best to build websites we’d recommend you throw out the Photoshop and embrace Domain Driven Design and the Linked Data approach every time. Even if you never intend to publish RDF it just works.   The longer term aim of this work is to not only expose BBC data but to ensure that it is contextually linked to the wider web.  
The idea is to free web of data.

BBC Gets It.

Monday, February 23, 2009

Shame on You Wall Street Journal

It is regrettable indeed.   I was deeply saddened and somewhat enraged by the Wall Street Journal's closing of its library.   In our information age, that depends so much on knowledge workers, Wall Street has decided that it could cut back taking away a vital piece of information news gathering, organizing, and dissemination of up-to-minute information.   Can news reporters expect to do all the work themselves?  Can they properly search for relevant and pertinent information? Is that even their jobs?  

Could we inset librarians and information professionals into the jobs of news journalists?   Of course not.  Wall Street - give your head a shake.   A knowledge centre, particularly in a top-notch industrial media giant such as Wall Street, requires expert searchers.    When asked, a spokesperson responds,

It is regrettable. Our reporters do have access to multiple databases including Factiva and this migration to digital databases as you has been happening for many years.

Sure.  Good luck with having your reporters spend up to ten times the amount of time it would take to find information a trained information professional could obtain for you in a fraction of the time.  A librarian is like the glue that holds the house together.  You can only go so far and so long without a librarian's information retrieval skills before the infrastructure cracks and crumbles.   Particularly in our emergine Web 2.0 world of social media and open access resources, can a company survive alone without expert information and knowledge management?  Best of luck Wall Street Journal.  

Saturday, February 21, 2009

Video Sharing for Librarians




I recently presented at TOTS. What is video sharing Whyshould we care? How can be of use for information professionals? What are some issues for us to consider? Let's take a look together.

Monday, February 16, 2009

Who Video Shares? Barack Obama Does!




Who uses Web 2.0 to its fullest capacity? Barack Obama does. The President posts regularly to Vimeo. Vimeo is different in that it offers High-definition content. On October 17, 2007, Vimeo announced support for High Definition playback in 1280x720 (720p), becoming the first video sharing site to support consumer HD.

Wednesday, February 11, 2009

Mashups at PSP 2009


View more presentations from Allan Cho.
I had recently given a presentation as part of a panel at the Association of American Publisher's Professional Scholarly Publishing (PSP) 2009 joint-pre-conference with the National Library of Medicine, titled "MashUp at the Library Managing Colliding User Needs, Technologies, and the Ability to Deliver."    Here are the slides I had used - any comments most appreciated.

Thursday, January 29, 2009

Is Youtube The New Search?

Information professionals everywhere take note: Google is uncomfortably sliding. Gone are the days that we 'google' for information. And now YouTube, conceived as a video hosting and sharing site, has become a bona fide search tool. Searches on it in the United States recently edged out those on Yahoo, which had long been the No. 2 search engine, behind Google. Interesting that Google owns YouTube, isn't it? In November, Americans conducted nearly 2.8 billion searches on YouTube, about 200 million more than on Yahoo, according to comScore. Here is what one 9 year old reveals about his information search behaviour in a New York Times article:

I found some videos that gave me pretty good information about how it mates, how it survives, what it eats,” Tyler said. Similarly, when Tyler gets stuck on one of his favorite games on the Wii, he searches YouTube for tips on how to move forward. And when he wants to explore the ins and outs of collecting Bakugan Battle Brawlers cards, which are linked to a Japanese anime television series, he goes to YouTube again. . .

“When they don’t have really good results on YouTube, then I use Google."

What does this mean? Are Facebook, Youtube, and Twitter going to take down the venerable goliath Google? Not really. I argued in an article that this is the phenomenon of social search. Are things finally catching up?

Monday, January 26, 2009

Ushahidi as a Mashup



I'm going to be talking soon about mashups. (And getting nervous about it, too). One mashup that I will be discussing is Ushahidi. It's an excellent example of how Web 2.0 is saving lives. Using technology to harness peace. More to come. Here is an excellent slide show of Ushahidi.

Wednesday, January 21, 2009

Nova on the Future of the Web

I heart Nova Spivak.  The grandson of management professor Peter Drucker, Spivak is an intellectual in his own right.  Not only is he a semantic web pioneer and technology visionary, he's also founded Twine, one of the first semantic web services out there.   I think he's one of the brightest minds today regarding ideas about the future of the Web. He's a visionary.   Here's a synopsis of Spivak's treatise Future of the Desktop.

1) The desktop of the future is going to be a hosted web service

(2) The Browser is Going to Swallow Up the Desktop

(3) The focus of the desktop will shift from information to attention

(4) Users are going to shift from acting as librarians to acting as daytraders

(5) The Webtop will be more social and will leverage and integrate collective intelligence

(6) The desktop of the future is going to have powerful semantic search and social search capabilities built-in

(7) Interactive shared spaces will replace folders

(8) The Portable Desktop

(9) The Smart Desktop

(10) Federated, open policies and permissions

(11) The personal cloud

(12) The WebOS (Web operating system)

(13) Who is most likely to own the future desktop?

Saturday, January 17, 2009

Topic Maps and the SemWeb

Half a year ago, I had a posting discussing Katherine Adams' seminal article about librarians and the Semweb. Katherine made a point about Topic Maps, which she believes will ultimately point the way to the next stage of the Web's development. They represent a new international standard (ISO 13250). In fact, even the OCLC is looking to topic maps in its Dublin Core Initiative to organize the Web by subject.

In the same posting, Steve Pepper, an independent researcher, writer and lecturer who has worked with open standards for structured information for over two decades, made a very interesting comment. He argues that:

Topic maps is really spearheading is nothing short of a paradigm shift in computing -- the notion of subject-centric computing -- which will affect far more than just the Web.

We've let programs, applications, and even documents occupy centre-stage for far too long. This is topsy-turvy: users are primarily interested in subjects (what the information is about), not how it was created or where it lives. We need to recognize this, and effect the same kind of change in information management that object-orientation effected in programming; hence the need for a subject-centric revolution.
Indeed, the Topic Maps 2008 Conference in Oslo, Norway, April 2-4 has just concluded. So what are topic maps, and why are they relevant for libraries and information organizations? The basic idea is simple: the organizing principle of information should not be where it lives or how it was created, but what it is about. Organize information by subject and it will be easier to integrate, reuse and share – and (not least) easier for users to find. The increased awareness of the importance of metadata and ontologies, the popularity of tagging, and a growing interest in semantic interoperability are part and parcel of the new trend towards subject-centric computing.

This conference brings together these disparate threads by focusing on an open international standard that is subject-centric to its very core: ISO 13250 Topic Maps, which is interestingly what Katherine Adams had pointed out eight years ago. We're getting closer. The pieces are in place. We just need a good evening to frame together the picture.

Monday, January 12, 2009

hakia and Librarians' Race to End the Search Wars

I've always been intrigued by hakia, which is considered the first SemWeb search engine of its kind. It is said that for the next generation web to exist, there needs to be a more concise way for users to find information and to search the web online. hakia is working with librarians to help make its results even more credible in the attempt to win the race to ouster Google in the current search engine wars. hakia is one of the first Semantic Web search engines.

However, besides QDEX (Quality Detection and Extraction) technology, which indexes the Web using SemanticRank algorithm, a solution mix from the disciplines of ontological semantics, fuzzy logic, computational linguistics, and mathematics, hakia also relies on the subject knowledge expertise of professionals. By combining technology and human expertise, it attempts to completely redefine the search process and experience. Take a look at my hakia, Search Engines, and Librarians How Expert Searchers Are Building the Next Generation Web for a deeper analysis of what hakia is trying to do with librarians. Hopefully, it offers more food for thought.

Thursday, January 08, 2009

A New Web 2.0 Journal

Web 2.0 The Magazine: A Journal for Exploring New Internet Frontiers is an important new journal that librarians and information professionals should take a serious look at. It attempts to fill the information gap in the area of Web 2.0 by focusing on new developments, the most used tools, trends, and reviews of books, articles, sites, and systems themselves so as to make Web 2.0 a useful part of the reader’s technology experience. Here is what Web 2.0 The Magazine attempts to do:

Admittedly, Web 2.0 is a hard concept to get one’s arms totally around as it means anything involving “user content”. This broad definition covers everything from social networks, such as Facebook, to 3D Virtual Reality Worlds, such as Second Life and World of Warcraft, with many, many stops in between. The unifying feature in all of the Web 2.0 systems and tools is that they differ fundamentally from Web 1.0, which is a one-way connection, in which information sources, vendors, advertisers, etc. present information for the reader to consume and / or respond to (the fact that a user may choose to buy on-line from Amazon or Sears does not make those sites something other than Web 1.0 since the user was not the one to initiate the content).

Thanks Dean for recommending this journal to me. It's an excellent read so far.

Wednesday, December 31, 2008

Stephen Abram and the World of Libraries

Stephen Abram is a smart guy. The first time I heard him speak was at the 2008 SLA Conference in Seattle. He was brilliant, to say the least. Abram's almost everywhere you turn your head, he is a workaholic to the nth degree. Abram is a innovative librarian who invests his energies in technology and trend forecasting.. Abram also has more than 25 years in libraries as a practicing librarian and in the information industry. In other words, I trust this guy.

Abram has also been highly acclaimed with numerous awards and leadership positions. He was named by Library Journal in 2002 as one of the key people who are influencing the future of libraries and librarianship. Served as President of both the Canadian Library Association (CLA) and Special Libraries Association (SLA). Here is a candid interview that Abram gave a year ago. He reveals he had to apply twice to get into library school, and how he learned the craft of public speaking.



Tuesday, December 30, 2008

Seasons Greetings

Seasons greetings everyone. This holidays, as you are enjoying your Christmas at home, please take some time in considering contributing to a worthwhile campaign. The campaign is Books for Darfur Refugees -- which give $1 for each book received as a holiday gift -- the website is: http://holiday.bookwish.org/.

Books for Darfur Refugees certainly appreciate your helping to spread the word, too. It is a 100% volunteer staffed; 100% of funds raised by this campaign for direct book related aid for Darfur refugees. The good news story here is the inspiration of Darfuris who self-organized their own English classes in refugee camps. For example, they view learning English as their "road to freedom."

Since sending two shipments of specifically requested ESL books to the camps in May 2008, the numbers of refugees learning English has jumped from 400 to 800 (as of July 2008) and now numbers more than 1,100! We are partnering with the British NGO, CORD, that runs education programs for UNHCR and UNICEF in the Bredjing, Treguine, and Gaga camps (60,000 refugees, about 20,000 students, about 1/2 are girls).
The website (http://www.bookwish.org/) shows inspiring photos of the refugees smiling and holding up the ESL books that were sent to them. Happy New Year everyone.

Monday, December 22, 2008

Professor Jerry Newman on Management

A while back ago, I had written on a book I recommended as a must read for those interested in management techniques and the ways in which people interact in a fast-paced workplace The book, My Life on the McJob, explores this, as Jerry Newman, a management professor at Professor at SUNY Buffalo, decided to conduct an experiment as he worked at seven stores over 14 months – two McDonalds, two Burger Kings, one Wendy’s, one Arby’s and one Krystal (a fast food chain in the South), with the stores being located all over America, in Michigan, Florida and New York. Newman worked all jobs, grill, register, custodian, and observed and documented experiences first-hand.

Newman's case study was so fascinating that I wanted to interview him and ask him more about his book and whether it applied to libraries, which often resembles a retail fast-food chain in terms of frenetic pace with customers and rigid tension between management and staff. Here is our interview:

Question: Libraries are every bit as dysfunctional as any organization. What can libraries learn from McJob? Is your book written for fastfood and retail only? Does it apply for all?

Newman: The book is relevant to any organization that has multiple shifts in the course of a day, or that has multiple units within the organization. I think libraries qualify on both accounts. The biggest problem in multi-unit operations, and this isn't just me speaking - mcd agrees with this - is the inconsistency across time and units. To be great, first you must be consistent. This isn't always "sexy", hence the reason for low interest.

Question: What can managers learn from your book? If there is one thing they can take away from your book, what would it be?
Newman:
  • Fast food jobs are HARD – both physically and mentally
  • These jobs provide opportunity to learn important life skills
    • Dealing with pressure situations
    • Communicating with peers
    • Managing conflict (with customers, peers)
  • Fast food is more representative of our country’s diversity and makeup than other industries
  • MOST INTERESTING: The store’s manager (and not corporate operations procedures and values) determines the climate and ultimately the success of the workplace

Question: What works? You had mentioned the four four R’s. What are they?

Newman:
  • Realism…People like predictability, set boundaries and expectations
  • Recognition…Be an ego-architect – reinforce self worth
  • Relationships…Build a social web, identify those employees that connect with others and use them to cultivate camaraderie among the troops
  • Rewards…Gold stars still work

Question: What were some challenges you found?

Newman: How to reward your employees when money is not an option.

  • Provide constructive feedback: Gold stars worked in elementary school, still work now
  • Recognize job proficiency by make an example of a strong employee
  • Offer flexible hours and job security
  • Facilitate social interaction – build a social web, make the work-place a fun-place to be
  • Advertise opportunities to advance
  • Build positive manager/employee relationships

Question: What are some key takeaways from your research in this book?

Newman:

  • Hiring decisions are key to store success and employee retention
  • Culture has the strongest impact on workers’ behavior – and managers are in control
  • Camaraderie and strong work ethic are a winning combination

Question: Were there any surprises during the extent of your experiences?

Newman:

  • Fast food is not an easy job
  • No forum for employee feedback and unsolicited feedback on operations/best practices is not welcome
  • Wide disparities exist across stores – even those with the same name
  • Women are better managers
  • Recognition is a powerful motivator