Thursday, June 22, 2006

The Never Ending Wonders of Google

I followed up on Google Scholar with an article I came across by James Caufield, Where did Google Get Its Value, which is not only fascinating and thought-provoking, but also forced me to think of the search engine within the library domain, not as a complement to the electronic and hardcopy collections, but ultimately as a "predecessor" of Google.Whether Google's success is based on the shoulders of librarianship is certainly up for debate (one in which I'd like to reserve front row seats for), it does offer food for thought in how we use and perceive Google and search engines. Here is what Caufield says are the strengths and weaknesses of Google:

Strengths
(1) Better Indexing – As Caufield argues, “Google brings a library value to the web environment [by] improved access through better Indexing." I'm certainly much more enlightened now on the mechanics of how Google works. (So that's what separates Google from the rest. . . ). Indeed, while other websites blindly ranks the relevance/importance of a website based on the number of key terms that a website has, Google uses a unique algorithm that ranks webpages based on the links that it has to other relevant websites. In many ways, this is almost as if the internet is "peer-reviewed," and relevance is constantly upheld by other websites. Whereas websites can get away on other search engines by simply padding their websites with any key terms to cover the wide spectrum of subjects, Google restricts this practice. If you highlight the rest of this line, you'll know what I mean. (See, you use the entire dictionary just to fill up entire pages with words! How ingenious yet devious!)

(2) Better Access through Simple and Disinterested User Interface - Instead of urging the user to stay within the same page for the purposes of advertising and data collection, Google erased this questionable practice when it introduced its plain and simple search engine box. What this did was improve access, and allowed the user to obtain information in a much timely fashion. (In other words, it made thing quick and tidy).

(3) Google Brings the Library Value of Unbiased Selection to the Web Environment - Unlike other web search engines, Google didn't accept any advertising fees. (Hence, the simple user interface). What this did was that content from its searches were uncorruptable, since all materials were on equal footing -- one didn't and couldn't pay money to get material slotted to a higher ranking.

(4) Google Produces Better Access through Uncorrupted Indexing - With some controversy, Google also "punishes" websites that try to manipulate and break the Google PageRank algorithm. So-called Search Engine Optimization plays by Google's rules and increases the ranking of webpages by creating inbound links. However, Google counters such moves by manipulating its own algorithms to match that of culprits. But in the end, who manipulates who? And at what cost? As the Machiavellian conundrum goes, "Does the end justify the means?"

(5) The Reference Interview - This one caught me offguard. However, Google apparently processes a reference transaction very much like a librarian by having cookies attached to every user. By having a history of the searches made by a user, Google has a better and more focused understanding of the user's need.

Negatives
(1) Privacy! - Not surprisingly, with cookies come problems of privacy. Indeed, one can see how Google can keep track of search histories simply by having it up on the RSS feeds. It's widely available for anyone to see his or her own searches. Which is nice if one is comfortable with it; however, if users share the same computers (or is unaware that they are), then privacy issues can certainly emerge.

Yahoo!
Interestingly, I have not been entirely aware of the conveniences of Google until only quite recently. For years, I have preferred Yahoo.com due to its familiarity (it's been around since 1994). While countless search engines have come and gone, Yahoo! has always stayed faithfully by my side throughout my online experiences. Moreover, not only has it been quite consistent with its user interface throughout its existence, it has a handy indexing system that separates different subjects for the user - not that anyone really uses it. . . ) But nonetheless, Google is the preferred choice of most users, and it seems as if it is here to stay for quite a while. Which leads me to a short anecdote. During an interview, the librarian asked me what search engine I prefer using. Naturally (and naively) I stood up for Yahoo! However, my reasoning was illogical in that I admitted that I liked it for nostalgic purposes. Hindsight is always 20/20, but if I had another chance. . .

Wednesday, June 21, 2006

Google Scholar

As fodder for future discussion, BMB Librarian Dean Giustini suggested that I read Neuhaus, et al.’s “The Depth and Breadth of Google Scholar: An Empirical Study.” The authors ask a series of questions, (1) How often is this database updated? (2) Does Google Scholar have particular disciplinary strengths and weaknesses? (3) How does the content of Google Scholar compare with that of other databases. To do this, the study compared the contents of 47 databases to Google Scholar. Here are the points which I found most germane to the discussion.

Weaknesses
(1) Time – Despite it’s vast net of coverage, the drawback of Google Scholar is that it is slow. As the authors discover in one example, there is at least a three-month time lag existed for uploading the information that appeared in BioMed Central Scholar into Google.

(2) Subject coverage – Google Scholar’s “short comings” is that its coverage is biased towards the sciences while its coverage of databases in other areas is somewhat poor. Thus, as a researcher in the humanities, education, business, and social sciences, using Google Scholar might not be as advantageous.

(3) No content collection statement – If Google Scholar had a detailed description of its content collection methodology, it would certainly allow users greater insight into the capabilities and limitations of the inner workings of Google Scholar. Because it doesn’t, researchers are basically grasping at straws.

Strengths
(1) Coverage - Despite its short comings on certain subjects, Google Scholar nonetheless has an expansive cast of open access journals, freely accessible databases, and single publisher databases – at the core strength of what Google Scholar offers is free content. Not just any free content, but scholarly-type information (both books and journals), which is immensely useful for information-seekers.

Interestingly, Based on the work that I am currently doing at the SFU Faculty of Health Sciences’ Centre for Applied Research on Mental Health and Addictions, I have had the serendipitous opportunity to do a lot of research from a user's point of view, particularly with health-related topics and issues.

My supervisor, Matthew Queree, a Researcher at SFU's Faculty Health Sciences who is particularly adept and quite experienced in mental health and addictions-related research in the areas of psychiatry and psychology, is an avid supporter of Google and particularly Google Scholar, arguing that they have revolutionized the way that research is done by scholars. With his enthusiastic backing, and insight from a "user’s" perspective, I was curious to do my own comparisons.

Methodology:
Key words used: (1) mental health; (2) primary care; (3) health care reform

Thoughts and Reflections:
(1) Commercial Websites – Although there is wide assortment of sources that is available onGoogle Scholar, I noticed that a great deal of these are "commercial" sites, and not articles like those I'd find in a journal. Thus, there is still a "search engine" element that slips through in Google Scholar, which may or may not be of benefit to the user. (It all depends on the objective of the user, I suppose...)

(2) Search Results – If not carefully limited, the search results can go into the tens of thousands of hits. As a result, it can be a rather frustrating experience for the user to sort through the diverse array of materials. For example, what is most frustrating about Google Scholar (based on my own experimenting) is the lack of chronological ordering. Articles that date as far back as the 1970's can be found together with recent materials in the 2000's. This results in hodge-podge combinations of results which the user must sort through him or herself in the end. In most electronic databases (such as PsycINFO), results can be sorted chronologically, thus allowing the user to find materials which are most current.

(3) Bibliographic Control - As the scholar Patrick Wilson said, the ultimate bibliographic instrument is one that can procure the "best textual mean's" to one's ends (i.e. finding as much information in as little time and as little effort as possible). If such is the case, then Google Scholar is only half-successful. However, like Neuhaus et al. argues, without a clear collection statement, it is somewhat teleological to argue that Google Scholar comes up short when compared to electronic databases. Perhaps that is not the primary function of Google Scholar; perhaps there it exists to serve other purposes. If so, I'd like to know!

Sunday, June 18, 2006

The Travails of Technology


I'd like to blame the past few years of my secluded existence of research in the basements of libraries as the reason for my lack of current tech-savviness. However, the reality is that I have simply neglected the world of technology and it has hurt me. I admit that I feel as if I am left in the dark, and have tried to cover up my ignorance with self-assurances that I'd catch up (it can't be that hard right?)

Interestingly, my present situation is reminiscent of the period before I had bought my first computer. The world had passed me by, and I realized I was clueless about computers. I scrambled to catch up, learning everything from DOS programming to breaking apart and piecing back together PC hardware. I read religiously all the latest PC literature (PC Gamer was my favourite), shopped frequently at Dopplers (that short but legendary predecessor of "Futureshop") and kept up to date on all things computer-related. (I was even briefly an audio/stereophile during my studies in Electronics 11 & 12). The point is, ever since I bought my first PC, an IBM 486SX 33Mhz, I was proudly confident that I'd never fall behind in technology again.

It wasn't until I started my LIS program did I realize just how far I have fallen behind. The farther I traversed in the technological wilderness, the greater the appreciate I have for information specialists. While much of their education focused on the basics of information management, much of the "real" learning is outside of the classroom, particularly with technology, which changes at blazing speeds. If Thomas Friedman's The World Is Flat represents the analysis of globalization in the early 21st century, I'd like to present my own views of the early 21st century: in particular, technology.
While Friedman has 10 ten forces that flattened the world, I have 10 forces (plus 1) which I believe have characterized technology in the early 21st century.

(1) Flashdrive - I must admit, I didn't buy a flashdrive until only very recently, when I really needed one. However, the reality is, I didn't even know (or cared) about the existence of flashdrives also until only very recently. Until now, I have still been walking in the shadows of the 3.5-inch floppy disk and 740 MB CD-RW (and also email). I never really considered any alternatives to these technologies. But the question remains, how long will flash memory linger before the next technology emerges to succeed it? Is the USB Flashdrive simply another cashgrab convenience for PC manufacturers? Or is a necessary technology?

(2) RSS - Really Simple Syndication, (or Rich Site Summary) is a form of web syndication used by news websites and weblogs. The first time I heard of it and really paid attention to its existence was in an LIS class, when someone made a facetious comment about putting an RSS feed to his webpage. I had no idea what the student, and at that moment, I was a bit worried whether it should be something I shouldn't be ignorant about. And the more I read, the more RSS comes up in everyday tech-speak. I'm glad to say that I have finally set up my own RSS feeds to My Yahoo! and am maximizing its uses. However, the question is, can RSS be considered a novel technology? Or is it merely an updated form of "Bookmarks." When I first encountered RSS, I smirked at its simplicity. Is it just a lazier method for surfers to bookmark all of their favourite links to one page? Can one not just simply click on a bookmarked link, and find the appropriate information him or herself?

(3) iPod - I must be one of the few people left which does not have an iPod - at least it feels like it. It seems as if people everywhere are carrying these tiny multicoloured machines. I am certain that there will be a day that I buy an iPod, since it's "probably" a more convenient way to listen to music and also because I obtain all my current CD's from MP3's, at the back of my mind, I question whether the iPod is any different from a CD Discman. I've grown to love the feel of taking out and putting back in a physical entity; until I get used to it, listening to mere digital bytes feels somewhat "un-audio" (perhaps reminiscent to the audiophile who still cherishes the vinyl). Which leads me to the question: just how many listening devices will I need to buy for other people's birthdays and xmas'? How long will the MP3 player last among long line of deceased products, i.e. the record, 8-track, audio cassette, CD)?

(4) Google - I am embarrassed to admit that I have never thought too much of "Google" until this year. It's been around since 1999. However, I've always preferred Yahoo.com as my preferred search engine, mostly because of nostalgic reasons (it was the "first" popular search engine), and secondly because of ignorance. Up until this fall, I have still regarded Yahoo as my preferred destination for online information retrieval. In fact, I never really thought too kindly to the overly simple interface of Google. (It's just a lazy blank screen around a search box! Until I realized that was its main purpose. . .) However, my question is, can Google last? Is it more than just a search engine for information retrieval? Or is it multimedia corporation? It seems that it is leaning towards the latter. However, I have seen the rises and falls of the Alta Vista's and Lyco's; time can only tell whether Google will follow suit. I remember I was an avid proponent of Northernlights.com, which for a while was ranked as the #1 search engine by many. Now, it's sadly lost its original domain www.northernlights.com.

(5) IP Address - An Internet Protocol Address is a unique number that devices use in order to identify and communicate with each other on a computer network. In the mult-complex world of the world wide web, it is really the only piece of information uniquely distinguishes one computer from another. If used properly, authorities can help solve online crimes simply by tracking down the user's IP address. With this said, I am still puzzled by its exact nature. What I do know is that it can be a user's worst nightmare if handled carelessly, such as adding a wireless router to an existing network.

(6) Blogs - They are websites ("weblogs") where regular entries are made (such as in a journal or diary) and presented in reverse chronological order. Although I am a huge proponent of blogs, it was not until very recently that I opened an account and realized its exceptional usefulness. However, with this said, I sometimes question the popularity of the blog, and also wonder how long it will endure. I first got introduced to the world of blogs, and really learned its nuances, when I completed an assignment on blogs. The more I read up on its history and its functions, and the more I blog, the greater the deja vu feel that I have of Geocities, Xoom.com, Angelfire.com, which are free webhosting services ever so popular in the late 1990's and early 2000's. I remember fondly in highschool and my undergraduate days when I would post most of my thoughts online, not unlike what most users of bloggers do nowadays as well. Its popularity has died down considerably (along with Yahoo, incidentally). Which leads me to my next point: are blogs simply an updated version of Geocities in a simplied form, minus the basic programming and graphics options? Or is it a start of something new, of how communication will eventually evolve among the online community? Of the new Internet 2.0?

(7) Podcasting - Podcasting is the method of distributing multimedia files, such as audio programs or music videos, over the Internet using either the RSS or Atom syndication formats, for playback on mobile devices and personal computers. Podcasting is among the technologies which I have had the least experience with. However, my first impression is that they eerily resemble the realaudio or mediaplayer videos which often complement the content on webpages. However, without enough experience with podcasting, I cannot really argue whether they are a continuation or are indeed an entirely different format. (Perhaps they fall in the middle?) What is certain is that they have a great deal of potential for change, both as a format and its usage.

(8) Bluetooth - The first experience I had with this technology was when a friend of mine was transferring photos she had taken from our trip to another friend sitting beside her. As they were waiting for the photos being uploaded, I was silently wondering what bluetooth meant, and what exactly was happening in that invisible exchange of digital technology. Quite simply, Bluetooth is not a technology, it is an industry specification. Bluetooth is an advanced wireless radio signal, very much like the ones used for wireless modems and networks. However, Bluetooth is a radio standard primarily designed for low power consumption, with a short range (power class dependent: 1 meter, 10 meters, 100 meters) and with a low-cost transceiver microchip in each device.

(9) Digi-cams - I was one of the first among my associates to have access to a digital camera when I purchased a Samsung Digital Camera Cellphone (the price still stings as I think about it). But I proudly took pictures of wherever I went to. Nowadays, the irony is that almost everyone has a digi-cam because nearly every new cellphone comes with one, while I have reverted to using an ancient cellphone which doesn't (the reason is that my digi-phone got broken). Nonetheless, digital photography is the current preferred format of photography; and sadly, the old darkroom photo-finishing appears to be near its end. While I'll miss the wonderful days of almost fainting from the chemicals and knocking into people in the darkroom, the fact is digital photography is a new and refined method of producing better photographs. Like the digital audio, it can refurnish the past by making them clearer and last longer through digital archiving. However, the fact remains that although digital photography is superior, it can never replace the nostalgic price of clicking and hearing the shutter of the analog camera.

(10) Bit torrent - Peer-to-Peer (P2P) has had a long history in online technology. In fact, without it, there can be no iPod and MP3. Unfortunately for the music industry and to a certain extent, the movie industry, much of the material transferred among P2P users is free, and any notions of otherwise is frowned upon by the P2P community.

Napster was the first widely-used peer-to-peer music sharing service, and it made a major impact on how people, especially university students, used the Internet. Despite a major lawsuit forcing its shutdown in 2002, it seems as if its demise only triggered a rebellion againstthe establishment as there are increasingly more P2P programs readily available for download, from all different regions of the world. The "new" Napster on the block is Bit Torrent; it has produced a multitude of offshoots such as Bitcomet (my personal favourite). But regardless of the legalities and its aims and objectives, the question also remains whether P2P will survive the next onslaught of new technologies. I remember fondly dialing up on my 56K modem, eagerly anticipating finishing my download of a 3.5MB song in an hour via an internet site, only to move onto downloading entire disc-sets in half that time. How long will another newer, faster, and cheaper product or service come along?

(11) This entry would be ironically bare without the very information which I obtained it from. Wikipedia is increasingly becoming a verb such as "googling," and if not for its lengthy name, it would be christened a verb even sooner. Simply put, Wikipedia is an online encyclopedia. It is open for all online visitors (with the IP address copied and tracked down) to edit its content. Wikipedia is written collaboratively by volunteers, allowing articles to be changed by anyone with access to the website. Wikipedia has redefined the way that information is published, and is part of the trend of "open access" publishing. Personally, I find the information invaluable, and I hope that it will continue to evolve along with the internet.

Thursday, June 15, 2006

Libraries on the Move

It didn't occur to me until I was reading the blueprints of the Biomedical Branch Library at UBC that a lot of libraries on being built -- or at least those that I've been associated with. First, the BMB will be on the move in late (really late) August. Langara College Library is on the move. North Vancouver Public Library's Lynn Valley main branch has just got a new "Town Centre" complex. And the library (oh, sorry, I meant to say "Information Service Centre") at the legal firm Fasken Martineau and DuMoulin is also on the move. Seems like I've got all types of libraries covered, now that I think of it -- academic, public, special, and hospital...

I'm very excited, to say the least, to be able to see first hand and perhaps play at least a small role in the BMB move. The new BMB was supposed to be built along with the Children and Women's hospital Eric Hamber Library in 1982. However, while the Hamber library was built (which still looks fabulous), the new BMB never materialized, and thus the project has been on hold ever since, until plans for it finally got underway in 1998. (But still...that's still a lot of years in waiting!)

As I was wading through the material (email exchanges, literature reviews, and blueprints) that the BMB librarian Dean Giustini left for me to analyze, I was simply amazed at the amount of information and learning that is required to undertake such a project. Here is what learned:

(1) Library school doesn't teach you this. Actually, it does, and it doesn't. And I do regret not taking Ann Curry's LIBR 578 "Library Planning." However, my reason is that I probably won't get many chances to build a library throughout my lifetime (ok, maybe only once -- max is twice if I change jobs). But truth be told, I am certain that there can be no education that can cover what the BMB librarian has gone through during the 8 years of library planning.

(2) Meetings, Meetings, & more Meetings. From the records and notes retained, the BMB library involves a lot of talking and communication among architects, administrators, contractors, not to mention librarians. And matters can concern anything from large topics such as budget allocation to minute details such as the what type of glass is used for the windows. Hence, I learned that to be effective in such huge undertaking of a project, communication is essential. All sides must be on the same page in order for the project to move forward. Even if this means that needs have to be flexible and accommodating.

(3) Space Changes - Reality Doesn't. One thing I noticed about the construction and preparation of library moves is that the blueprint sometimes do not fit existing environment. For example, at both the BMB and Faskin Martineau and DuMoulin are moving into smaller spaces. This means that collections will somehow need to be reduced. That ultimately means questions will revolve around moving to more electronic journals or decreasing the volume of the monographs. At FMD, the head librarian has already indicated that older volumes will be gone; and some of the lesser-used materials (such as American law materials) will need to cut according to the level of concentration that its lawyers practice.

Tuesday, June 13, 2006

New Book


Need a break from LIS stuff. At the core of what we do, the book still remains the front and centre. It's what initiated our pursuits in this field, and sometimes, I feel that we get lost in the forest, and lose perspective on the importance of a good read.

Recently, I read Dean Koontz's Velocity. My fascination and admiration of Koontz has skyrocketed ever since the book. I just couldn't put it down, couldn't turn the page fast enough. Certainly, Koontz is no literary genius, but his prose is witty, entertaining, and eloquent.

I've always been a closet Koontz fan. I've read his Frankenstein series; I'm eagerly anticipating the finale of his trilogy, which is delayed, oddly enough.