Allan's Library: May 2008

Saturday, May 31, 2008

Introducing WebAppeal

There are some good Web 2.0 applications and websites. Then there is WebAppeal. The web service is based on the principle of 'Software as a Service' (SaaS), which is rapidly gaining popularity. The uprise of innovative online applications makes traditional and expensive software unnecessary. Examples of successful web applications are video service YouTube and free music service Last.fm. To bring some structure and insight into these ever-growing technologies, http://www.appappeal.com/ informs consumers as comprehensively as possible about all the possibilities SaaS web applications have to offer.

Although we're in the age of Web 2.0, one of the main challenges remains information overload. Too much information does not necessarily mean knowledge. That's why I find AppAppeal to be a convincing website which provides insightful reviews of applications and indexes them according to utility. On this website, all applications are organized in categories such as "Blogging", "Personal Finance" and "Wiki Hosting". The website is still being developed. Soon, tools will be added to create an interactive community around web-based applications.

There are already Web 2.0 review sites such as Mashable, All Things Web 2.0, or Bob Stumpel's Everything 2.0. But WebAppeal goes one step further. It analyzes the advantages and disadvantages of particular applications, providing demo videos. I really like this website. It's a good complement to a project that Rex Turgano and I are collaborating on: Library Development Camp, which not only reviews Web 2.0 applications, but offers trial accounts for users to try out different applications. Together we make a great punch. Stay tuned. More to come. . .

Thursday, May 29, 2008

Day 4 of TEI/XML Bootcamp

Day 4 has come and gone. What did I learn? XML is not easy. Programming is even tough business, not for the faint of heart or mind. The main challenge that I had, and made my head spin, was learning the complexities behind XHTML and XSLT. A powerful tool for the construction of the Semantic Web is XHTML. Most people are acquainted with the "meta" tags which can be used to embed metadata about the document as a whole. Yet there are more powerful, granular techniques available too. Although largely unused by web authors, XHTML and XSLT offer numerous facilities for introducing semantic hints into markup to allow machines to infer more about the web page content than just the text. These tools include the "class" attribute, used most often with CSS stylesheets. A strict application of these can allow data to be extracted by a machine from a document intended for human consumption.

Although there have been several proposals for embedding RDF inside HTML pages, the technique of using XSLT transformations has a much broader appeal. Because not everyone is keen to learn RDF, and it thus presents a barrier to the creation of semantically rich web pages. Using XSLT provides a way for web developers to add semantic information with minimal extra effort. Dan Connolly of the W3C has conducted quite a number of experiments in this area, including HyperRDF, which extracts RDF statements from suitably marked-up XHTML pages. What can librarians do?

The Resource Description and Access is just around the corner. And there is much buzz (good and bad) that it's going to change the way librarians and catalogers think about information science and librarianship. I encourage information professionals to be aware of the changes to come. Although most are not going to be involved directly with the Semantic Web, they can keep abreast of developments, particularly exciting developments in information organization and classification. Workshops and presentations about the RDA are out in droves. Pay attention. Stay tuned. There could relevancy in these new developments that spill into the SemWeb.

Tuesday, May 27, 2008

The Digital Humanities

I am Day 2 of the Digital Humanities Summer Institute. Prior to this workshop, I had no inkling of what was digital humanities. Not anymore. The Digital Humanities, also known as Humanities Computing, is a field of study, research, teaching, and invention concerned with the intersection of computing and the disciplines of the humanities. It is methodological by nature and interdisciplinary in scope. It involves investigation, analysis, synthesis and presentation of knowledge using computational media. provides an environment ideal to discuss, to learn about, and to advance skills in new computing technologies influencing the work of those in the Arts, Humanities and Library communities.

I'm currently taking Text Encoding Fundamentals and their Application at the University of Victoria from May 26–30, 2008, taught by Julia Flanders and Syd Bauman experts in using the Text Encoding Initiative (TEI) an XML language which collectively develops and maintains a standard for the representation of texts in digital form in order to specify encoding methods for machine-readable texts. And it has been a blast. This has been the seventh year of its existence, and already it has gained the attention of academics and librarians across the world.

The DHSI takes place across a week of intensive coursework, seminar participation, and lectures. It brings together faculty, staff, and graduate student theorists, experimentalists, technologists, and administrators from different areas of the Arts, Humanities, Library and Archives communities and beyond to share ideas and methods, and to develop expertise in applying advanced technologies to activities that impact teaching, research, dissemination and preservation. What have I learned so far? Lots. But most of all, just how much XML plays in the Semantic Web. But more on that in the next posting . . . stay tuned.

Friday, May 23, 2008

One Million Dollar Semantics Challenge and API

The SemanticHacker $1Million Innovators’ Challenge and new open API for Semantic Discovery has recently launched by TextWise, LLC. The Challenge enables developers to showcase the power of TextWise’s patented Semantic Signature® technology and accelerate developing breakthrough applications.

The Challenge provides incentives to encourage creation of software prototypes and/or business plans that demonstrate commercial viability in specific industries. Are you up to the Challenge? Go to Semantichacker.com to experience the technology first-hand in our demo and learn more about how to enter the $ 1 million challenge.

But what are Semantic Signatures®? They identify concepts and assign them weights; in order words, they're the ‘DNA’ of documents which in essence become highly effective at describing what the documents are ‘about.’ Semantic Signatures® enable Web publishers and application developers to automatically embed consistent, semantically meaningful tags within their content for use in classification, organization, navigation and search.

In many ways, that's what librarians can offer in terms of information structuring and organization. Interestingly, textwise technology will have a spot at the Semantic Technology Conference in San Jose on May 21, 2008. I won't be able to attend. But if you are, could you give a write-up? I would be forever in your debt.

Thursday, May 22, 2008

Dublin Core is Dead, Long Live MODS

Jeff Beall wrote an article called Dublin Core: An Obituary. In it Beall asserts that the Dublin Core Metadata Initiative is a failed experiment. Instead, MODS is the way to go. And this was back in 2004! What is MODS? The Library of Congress' Network Development and MARC Standards Office, with interested experts, is developing a schema for a bibliographic element set that may be used for a variety of purposes, and particularly for library applications. As an XML schema it is intended to be able to carry selected data from existing MARC 21 records as well as to enable the creation of original resource description records.

It includes a subset of MARC fields and uses language-based tags rather than numeric ones, in some cases regrouping elements from the MARC 21 bibliographic format. This schema is currently in draft status and is being referred to as the "Metadata Object Description Schema (MODS)". MODS is expressed using the XML schema language of the World Wide Web Consortium. The standard is maintained by the Network Development and MARC Standards Office of the Library of Congress with input from users.

Here's what MODS can do that the Dublin Core can't:

1. The element set is richer than Dublin Core
2. The element set is more compatible with library data than ONIX
3. The schema is more end user oriented than the full MARCXML schema
4. The element set is simpler than the full MARC format

In my article at the Semantic Report, I argue that the DCMI is potentially relevant to the SemWeb because implementations of Dublin Core use not only XML, but are based on the Resource Description Framework (RDF) standard. The Dublin Core is an all-encompassing project maintained by an international, cross-disciplinary group of professionals from librarianship, computer science, text encoding, the museum community, and other related fields of scholarship and practice. As part of its Metadata Element Set, the Dublin Core implements metadata tags such as title, creator, subject, access rights, and bibliographic citation, using the resource description framework and RDF Schema.

So will the Dublin Core’s role in knowledge management activity representation be significant in the emergence of the SemWeb? So far, MODS hasn't done the job. Even though it has claimed that it can do so. Is this the problem similar to the situation during ancient Chinese period of the Hundred Schools of Thought? Who will win in the end? Or which ones? Perhaps opportunities and possibilities are much higher than narrowly looking for one path for absolute knowledge. So we march on . . .

Tuesday, May 20, 2008

Post-modern business in the Free World - Open Access & Librarians

I came across this interesting article from the Vancouver Sun, Post-modern business model: It's free. Videogame company Nexon has been giving away its online games for free, and making its revenue from selling digital items that gamers use for their characters. Garden says his business is as much about psychology as it is about game design. It’s no good to sell a bunch of cool designer threads to a character who is isolated in a game, because no one will see how good he looks.

Free games can have a dozen different revenue models, from Nexon’s microtransactions to advertising, product placement within a game, power and level upgrades, or downloadable songs. However, on the question of videogames (or any other digital product) being offered to consumers for free. Much of the principles of Nexon is based on Chris Anderson's "free" concept.

“No one says you can’t make money from free." What does this mean for libraries? Especially since much of the mandates and goals of libraries are not to make money? The possibilities are there. A great number of libraries are already dipping into open access initiatives, particularly at a time when database vendors and publishers are charging arms, legs, and first-borns. With Web 2.0 technologies forming an important foundation for digital and virtual outreach opportunities, and the SemWeb on the horizon, I encourage librarians and information professionals to put on their thinking caps and think together in a collaborative environment to break down the silos of information gathering, and move towards information sharing.

Sunday, May 18, 2008

Librarian 2.0

Sometimes you just read an article, and go, I get it. A lightbulb shines brightly above you. Then you quickly turn it off to be energy saving. And quickly run to the computer to blog about it. Professionalizing knowledge sharing and communications is worthy of praising.

There’re a lot of articles that deal with the Library 2.0 mantra. But John Cullen goes beyond that, and proposes the idea that Library 2.0 should extend to the librarian. It should be Librarian 2.0. And what does that mean?

The key is developing communicative orientation: one that turns the old, tiring stereotype of library work being quiet, reflective and procedural, to one that is primarily focused on listening, engaging and developing understanding of the unique position of every individual.

In other words, just as much as technology is important to the library, we must also be alert of the changing nature of information and the profession. No longer are librarians doing the same duties repetitively and mindlessly. Web 2.0 technologies are merely the surface manifestation of L2. The opportunity is there to use this paradigm shift for us in teaching other professions how to actively engage with their service consumers. All aboard!

Friday, May 16, 2008

Search Monkey and the SemWeb

We're getting closer. Yahoo is incubating a project code-named "Search Monkey," a set of open-source tools that allow users and publishers to annotate and enhance search results associated with specific web sites. Using SearchMonkey, developers and site owners can use structured data to make Yahoo! Search results more useful and visually appealing, and drive more relevant traffic to their sites.

The new enhancements differ from Yahoo's "Shortcuts" that sometimes appear at the top of search result pages. Shortcuts are served by Yahoo whenever the search engine is confident that the shortcut links are more relevant than the other web search results on the page. Often, shortcuts highlight content from Yahoo's own network of sites.

The new enhancements can be applied to any web site. Publishers can add additional information that will be displayed with the web search result. For example, retailers can include product information, restaurants can include links to menus and reviews, local merchants can display operating hours, address, and phone information, and so on—far more information than a title, URL, and description that make up current generation search results.

Here's the exciting thing. As Search Engine Land reports:

Anyone can create an app for a web site. Yahoo is collecting the most useful apps into a gallery that you as a searcher can enable for your own Yahoo search results. For example, if you like the app that was created for LinkedIn, which shows a mini-profile of a person, you can include that app so that the mini-profiles display whenever you search on a person's name.

It's true. The SearchMonkey developer tool helps users find and construct data services that you can use to build apps. Once you've built your app, you can use it yourself and share it with others. Take a look at this :)

Wednesday, May 14, 2008

From Dublin Core to the Semantic Web

I've just published a piece in the Semantic Report titled, The Semantics of the the Dublin Core – Metadata for Knowledge Management. It's an experimental piece about the potential for applying principles from the Dublin Core Metadata Initiative for the SemWeb. In a previous article about half a year ago, Dean and I had proposed that the library catalogue could be used as a blueprint for the Semantic Web. Perhaps theoretical and conceptual, the arguments fleshed out the ideas, but not the practical applications. In this latest article, I wanted to outline in greater detail how exactly developments in library and information science are playing out, not only in the SemWeb, but for knowledge management in general.

Can the DCMI provide the infrastructure for the SemWeb? It could. Or it could not. Some have gone as far as saying that the Dublin Core is dead. But I'm not going to add more to that discourse. What I wanted to do was find apparently disparate entities: B2B, the Dublin Core, and the SemWeb, and tie them together using principles of knowledge organization in the form of the DCMI. Blasphemous? Perhaps.

My point in the article isn't to create something out of nothing. The purpose is to extend the idea that knowledge management for librarians and information science is nothing new. In 2002, two years before Tim O'Reilly's coining of the term, "Web 2.0," librarian Katherine Adams had already argued that librarians will be an essential piece to the SemWeb equation. Her seminal piece, The Semantic Web: Differentiating between Taxonomies and Ontologies, Adams argues that ontologies and taxonomies are synonymous - computer scientists refer to hierarchies of structured vocabularies as "ontology" while librarians call them "taxonomy." What the Dublin Core offers is an opportunity to bridge together different topics and extend across disciplines to navigate the complexities of the SemWeb. Fodder for discussion. But good fodder nonetheless I hope.

Monday, May 05, 2008

Library Development Camp

I'm excited to announce the formation of Library Development Camp. Our initiative is to help fellow librarians and information professionals in Canada to explore and learn about the latest web tools and technologies from colleagues who actually use them. This web community is open to any one working in the library or information management field in Canada.

How does this work? Most of the magic happens "offline" as we try to meet up in person to discuss these tools as well as give demos, training, hold discussions and debates, and share ideas and tips on how to effectively use these tools in a workplace or even on a personal level. It's all about sharing. We hope to spawn other LibraryDevCamp groups across Canada. If you would like to start one up in your city, lets us know and we'll set up a section on our web site.

Any library/information professional who already use any of these web tools/services are welcome to join and be a LibraryDevCamp.ca contributor or moderator. So far, we have an all-star cast of experts, such as Dean Giustini, Eugene Barsky, and Rex Turgano. We hope to have you join us, too. In the spirit of Web 2.0, our virtual meeting place is hosted by Moveable Type, a weblog publishing system developed by the company Six Apart. Please stay tuned as we expect our community to grow, not only in members but also in exciting ventures.

Thursday, May 01, 2008

Economics 2.0

Although I enjoyed Economics 100 (Micro and Macroeconomics) and had learned a great deal - I have to admit it wasn't the most exciting courses at time. The textbook we had used was Gregory Mankiw's Principals of Economics. (I still have copies of the textbooks). He has written two popular college-level textbooks: one in intermediate macroeconomics and the more famous Principles of Economics, which is popular among high-school Advanced Placement Economics teachers. More than one million copies of the books have been sold in seventeen languages.

Mankiw was also an important person in American politics, as he was appointed by President George W. Bush as Chairman of the Council of Economic Advisors in 2003. He has since resumed teaching at Harvard, taking over the introductory economics course Social Analysis 10 (which he affectionately refers to as "Ec. 10"). However, Mankiw also believes in using Web 2.0.

This is Mankiw's purpose for the blog:

I am a professor of economics at Harvard University, where I teach introductory economics (ec 10) among other courses. I use this blog to keep in touch with my current and former students. Teachers and students at other schools, as well as others interested in economic issues, are welcome to use this resource.

What's exciting about Mankiw's blog is the fact that it dips into the Web 2.0 blogosphere. The blog is much more than just a website. It's an intellectual and virtual space for him to keep in touch with colleagues and students, of marketing his profession and work to the non-expert. It's fantastic outreach. Librarians everywhere should take notice.

Pages