Spamalot?

April 23rd, 2009

I’m curious how other people are handling the tremendous amount of spam that’s coming across the “comment” section of this blog.   I would say I get at least 4 or 5 comments a day that are links out to garbage.  These are easy….   Then, I get a comment every now and then that says something like “I really enjoy reading your site.  Keep posting.  It is very interesting.”  At first, I was really happy to get good feedback, but it struck me after a couple of weeks that this sort of generic positive comment might be some sort of spam as well.  I’ve  added some of those messages without any seeming negative effect; however, I’m wondering if those generic positive comments, when posted, breed more spam?

Am I a sucker?  Have I been taken in by flattery?

or

Am I a jaded blogger, mistrustful of genuine positive feedback?  (If you are reading this and I’m talking about one of your comments, please ignore this post and keep the love-train rolling).

Are other people wondering about this too?  Is this happening to other people?

Doodle in Moodle!

April 16th, 2009

I am happy to report that my upload to Moodle proved much easier than my attmepts to FTP up to the Alabama server.  (As it turns out, I had a bit of a space issue with older files gumming up my space on the Alabama server.)  Moodle worked incredibly well and it seems like a terrific platform for running a college-level class.

It was a bit hard to find out how to create sections for things like RSS feeds (under “Blocks” menu), but once you get the feel for how it works, I can see it being a very, very powerful tool.  I’m going to look for a some side-by-side comparisons of Moodle and Blackboard.  It would be nice to get an idea of how they stack up against each other.

Also of note, I started telling my wife about Moodle (she teaches 9th grade English) and she said, “Oh yeah, we (her highschool) looked at Moodle and it stunk!”  “Why?” I asked her.  She said that teachers could not simply copy over a whole class-worth of content in one big chunk.  In highschool, she will sometimes teach 3 sections of the same class - sometimes at very different speeds and ability-levels.  It seems her highschool was looking for a way to create a class page once, copy it for the different sections, and then tweak it.  Apparently, Moodle does not allow for copying of whole classes.  The teacher would have to re-upload every piece of content into the different sections.  My guess is that Blackboard is the same, and that the “fault” comes from these products being designed with colleges in mind rather than highschools.  The right tool for the right job, I suppose…..(although, Moodle might have changed since my wife was working with it a few years ago - let me know if any of you have info on this).

So here, at the end of the semester, we are presented with a powerful learning management tool (Moodle) that takes full advantage of the flexibility offered by metadata (SCORM).  Although I, admittedly, don’t understand the fineries of SCORM, I can see how metadata sharing and portability allow for much more than simply information search-and-retrieval.  In a LOM environment, metadata sharing allows for the dynamic exchange, duplication, and  building of content.

I suppose I’ve read a lot of sentences like the one that ends the paragraph above, but I really couldn’t get an idea of just how powerful metadata (especially learning object metadata) could be without actually getting my hands dirty and doing it myself.  Glad to get the chance!

Update to Learning Guide Project

April 2nd, 2009

So, I was finally able to FTP up my learning object tutorials.  Yea!  I realize that I should have heeded Nitin’s warning about the small size allotment of our server space.  As it turns out, all I needed to do was delete a previous project I had put up there for Dr. Bonnici.  I’m including only one of the tutorials in this blog because one is very tinny (my computer mic stinks - see the post below) and the other is not syncing properly (again, see the post below).

Here is the best of my three, which, consequently, is the one without audio.

http://bama.ua.edu/~dldesanto/Subject Guide Tutorial_demo/Subject Guide Tutorial_demo.htm

I can’t wait to poke through other people’s projects!

Using Adobe Captivate software

April 1st, 2009

Well, I have certainly learned a lot by using Adobe’s Captivate software and going through the process of posting a tutorial as a web page.  As I write this, I’m not out of the woods yet;  my SSH Secure Shell is refusing to FTP up my files to the Alabama server.  But, I thought I’d blog about what I’ve learned so far.

Learning Object 1:  “How to Order Tickets from the Boston Red Sox”:

I thought for my first tutorial I would choose something that interested me.  “Why not the Boston Red Sox?”  I asked myself.  So, my first tutorial is  “How to order tickets from the Red Sox website.”  In hindsight, this might not have been the best choice.  The text on the site is very jammed together and the mouse is hard to follow through the text and rotating photographs.

I also used the microphone on my computer for this tutorial.  The sound is VERY static-y and a is poor, poor quality.  Upon poking around a bit, I found that I could also use Captivate’s  “Line 1″ option and use the mic on my headset.  This worked much better.  I used the headset for tutorial #2.

I liked all of the editing options in Captivate.  It allowed for many different ways to edit the screen capture and audio, and each editing feature allowed for a greater or lesser level of editing granularity (i.e. do you want to edit just this slide or the whole project).

Learning Object #2:  How to Use the University of Vermont’s Databases:

I thought that my other tutorials should relate to something in the field of librarianship.  My second tutorial ended up being a very “quick and dirty” run-through of the University of Vermont’s database list.  I started experimenting by adding text boxes and highlight boxes as well as the voiceover and mouse I had used in Learning Object #1.  The text and highlight boxes do a much better job of attracting the viewer’s attention, it seemed to me, so I decided to rely on them solely and cut out the voice over component in Tutorial #3.

I also found that in both “Preview” and “Publish” mode, the slides would hang-up or “stutter” and the audio would get out of sync.  I have no idea why this is happening.  When I went back to edit, each slide was fine - completely synced.  It is the “knitting together” of slides when previewing/publishing that seems to be the problem.  This actually makes me very wary of Captivate.  I’ve heard from others that Camtasia is a superior product, and this experience makes me think that that might be true.  I’ve used an open source product called “WINK” before that had less in the way of features but never once got hung-up like this.  Syncing slides is perhaps the most important part of this kind of software, and if this is Captivate’s Achilles’ heel, that is problematic.

Learning Object #3:  How to Find University of Vermont Library Subject Guides:

As stated above, I tried creating this tutorial sans voice-over.  All instruction and explication is done in text boxes.   I actually liked this method a lot.  I didn’t have to worry about the audio portion, it’s more easily ADA compliant, and it allows the viewer to focus on only the visual aspect.  Text-only instruction seems to make a lot of sense if the learning object is pretty straighforward.  Certainly, if lengthy explanations are needed, voice overs would be necessary, but this final learning object enforced to me the “less-is-more” mantra.

All in all,  I really wish I could publish these things!  I’m going to try to work on my SSH problem and post these three objects.  I really enjoy working on L.O. projects, and the creation of each Learning Object becomes its own learning experience (totally aside from the content “to-be-learned”).

Semantic web: kudos and concerns

March 11th, 2009

rosie

Adding semantic value to web content seems like it will bring about a whole boatload of both good and bad. From the readings, it seems like the sematic web will create systems that are far, far more interoperable.  I’m sitting in an airport as I write this, so I’ll use my situation as an example.  As I understand it, since my flight is delayed, a semantic web could take my flight information from Continental Airlines’ website and, using my calendar, update my schedule for that day.  It could also keep tabs on bus and shuttle options for my landing time (which is getting further and further away) and prompt me with the best option for transportation when I land. It could talk to the hotel and update my expected check-in time. The goal is, as Jason Ohler states it in his article “The Semantic Web in Education,” data integration. This kind of data integration (in my case travel data integration) would happen because computers could recognize my name as “name,” flight number as “flight number,” and destination city as “destination.”

In the example above, a semantic web would require an integration system to glean information from: airline websites, public transport websites, private shuttle websites, calendar apps, and hotel software. It’s all possible, I suppose, and I’d sure like to invest in the company that comes up with the software that actually integrates all of that imbedded semantic content.

So, wow! That’s all great. It looks like the semantic web is working out something akin to Rosie from the Jetsons: a computer that does all of the finding, sorting, and planning for you. You say “figure it out” and the computer figures it out. And it’s all possible because content creators are pre-semantic-ifying their content. This sounds great. I’m not very happy sitting in my naugahide, seat waiting for my flight, and I bet I’ll be even less-happy at 11:00pm tonight when I roll into the Seattle/Tacoma Airport and need to figure out how to get to my hotel.

Now, for the flip side (you knew there had to be one, didn’t you). As with the concern that I posted earlier in my “Pay no attention to the process behind the curtain” post, I again have to ask what happens when control is taken away from the user? As systems become more interconnected, they become more interdependent and, seemingly, more brittle. A screw-up by one of the content providers listed above (airport website, public transit website, etc.) could seemingly throw off everything else, causing a chain reaction and possibly a really, really, really frustrating trip. Figuring things out at 11:00pm in a city I’ve never been to stinks; being stranded at the Seattle Airport because something in my semantic data collection went awry would be even worse.

To use another example, I drive a stripped-down ’96 Honda Civic. It doesn’t have power windows or locks. I like it that way. When something in the door breaks, as inevitably it will, I can fix that one part simply and cheaply. If I had power locks or windows and something broke, I would need to replace the whole interconnected mechanism (which is expensive). Thinking of this in terms of the semantic web, when one piece of information is faulty, the whole interconnected system of data is compromised. As a culture, I think it would be wise to think long and hard before turning over our data collecting (what our brains are programmed to do, incidentally) to automation.

Privacy, I would think, would also be a concern for semantic web developers. If our names start being marked as “name” and then attached to a host of other data about us, this makes it vastly easier to, well….collect that data. I would certainly want to know how my privacy is being safeguarded by a web that makes it so very easy to collect data.

My final concern is marketing-based. When I log on to Amazon, it’s quite clear that they are using features of a semantic web already (some of our readings also single out Amazon’s usage of semantic web technology). They have tracked my shopping habits and make suggestions about what products I ought to buy. They have also aggregated shopping data to find “most popular items.” By identifying my preferences, favorites, and habits – these companies can tailor their advertising specifically to my tastes. As soon as I changed my Facebook relationship-status to “engaged” the advertisements on my Facebook pages changed to wedding-related ads. These companies have figured out that they can sell to me more efficiently. This isn’t such a big deal with Amazon and Facebook, since they are only a few companies out of the hundreds I encounter every day, but imagine if every company did this? Now imagine that they could put this advertising in our Kindles, IPods, PDA’s etc. Just think of what efficient consumers we could become!

In any case, I can’t get M.T. Anderson’s book Feed out of my head (no joke intended there). If you’ve read it, you’ll know why. But, with all of the cautions, I also love using Pandora (which seems to assign semantic value to music) and I actually like Amazon’s suggestions, so I certainly am amenable to the development of a semantic Web 3.0. I just think that a semantic web raises a lot of questions. Progress ought to be careful and deliberate. Considerations of privacy and public space ought to be considered.

Keeping a human as the active agent in information gathering and assessment also seems important. I realize that this goes against the “integration” value of some semantic web projects, but automation can lead to an information house-of-cards if something goes wrong. Perhaps that final thought is general and obvious, but as I write this as my laptop, with all of this memory, programs, applications, and computing power is dying because of a faulty power cord (a wire and four-dollar surge protector box), so I’ll end this post and hope that I get on a plane soon.

Quick thought on Microcontent

March 3rd, 2009

potato-head

“It’s amazing how quickly things change online.”  Cliche’ at this point, right?  Maybe, but every now and then I run up against an essay or blog post that’s only a year or two old whose discourse seems anachronistic in light of current Web 2.o technologies.    Lorcan Dempsey’s and Richard MacManus’ posts from 2005 and 2006 regarding the advent of microcontent now seem like one of those old early 1900s advertisements heralding the newfangled technology called the “radio.”

Since these posts were written, content has indeed become much, much more separated from its presentation.  On my Facebook page, I can pull in and display the most popular news stories from the New York Times.  I can post other RSS feeds.  I can post a YouTube video.  I use my Twitter account to update my Facebook status from my cell phone.  Sometimes, the path of what-is-linking-to-what and being-posted-where becomes dizzying.  I hate the term “mashup” so instead I’ll call it a Mr. Potato Head (don’t worry Mr. PH, no mashing here).  The web is becoming a Potato Head - interchangable, movable, dynamic - and the companies, businesses, and organizations that realize it are getting a jump.

I always think of NBC as my chief example.  A few years back, they freaked out and pulled and pulled all of their content from YouTube.  They fell back on the old-school content-rights positions of “My content.  Mine, mine, mine.”  “Ok” users said, you can have your content, then.  And along with it, you can also have all of the free publicity shows like Saturday Night Live were getting by having skits openly posted.  So, NBC re-thought their position and now offer users a good chunk of their content for free.  In fact, they encourage you to “Share” and “Tag” their videos and probably make a little money on the side by throwing in some advertising.  Pulling apart the pieces and giving it to users to reassemble (as Dempsey and MacManus pointed out a few years ago) is quickly becoming a major method for content delivery.

So, kudos Lorcan Dempsey and Richard MacManus.  You were way, way out ahead of the curve on this one.  I wonder how many more ways we can mix and match the Potatohead?  Will the pendulum swing back toward stability and static content representation?

Survey Results!

March 1st, 2009

A few posts ago, I put up a survey to test (in a very, very rudimentary way) whether tagging was a viable means to index subject-specific images.  The example that I tried to keep in my head was that of an art historian looking at art slides.  While the lay-person might be able to identify “ofness” tags when looking at a work of art (fruit, bowl, grapes, table), it takes a specialist in the field to assess “aboutness” (vanitas style, dutch renaissance, pointillé technique).

Since our class is comprised of specialists in the field of information science, I thought that we might be able to create and assess tags for images relating to information science.  I tried to choose images that related to our field  or, better said, that have and “aboutness” only an information science specialist would “get”.

The tagging results are below.  Seventeen people viewed the survey but only four people completed it.  THANK YOU to those of you who took the time to complete the survey!!!


meebo

“Ofness” tags assigned:

meebo, IM, instant messaging, chat, Meebo instant message screen, im client, chat dialog box

“Aboutness” tags assigned:

virtual reference, virtual reference, Virtual reference, user services, distance education, reference interview


marvrecord

“Ofness” tags assigned:

harry potter, MARC record, catalog, MARC record, Harry Potter, J.K. Rowling, marc record, MARC record

“Aboutness” tags assigned:

MARC, bibliographic data, indexing, access, records, Harry Potter and the prisoner of Azkaban (title), catalog record of J.K. Rowling’s Harry Potter and the Prisoner of Azkaban


secondlifelibrarian1 “Ofness” tags assigned:

librarian, reference, Second Life reference desk, woman, purple suit, Librarian standing in front of ref desk, Second Life screen shot of female avatar

“Aboutness” tags assigned:

second life, second life, virtual libraries, Web 2.0 tools, outreach, releva?,  virtual reference desk in Second Life


lcclassification1

“Ofness” tags assigned:

classification, library of congress, LC categories, table, list of abbreviations and related items

“Aboutness” tags assigned:

LC class headings, cataloging, indexing, authority, LC classification, Library of Congress classification system


pisa1“Ofness” tags assigned:

pisa, html, leaning tower, Eifel Tower, html coding, photograph, leaning tower of pisa, leaning tower of Pisa

“Aboutness” tags assigned:

humor, categorizing images, flexibility of XML, icons, Defines Italy, I have no idea


ebsco1“Ofness” tags assigned:

ebsco, search, EBSCO search interface, Search engine web page, database search interface

“Aboutness” tags assigned:

user interface, Boolean search, searching ease, interface construction, Boolean operators, Start of a literature search at JHL, EBSCO host ASP search interface for JH Libraries


deweytea1“Ofness” tags assigned:

tea bag, tea, mug, numbers, tea cup, tea bag, call number, Tea cup, filled cup, teabag

“Aboutness” tags assigned:

Dewey Decimal, humor, ease of call number system, Start of the day!, a recently brewed mug of tea


story_time1“Ofness” tags assigned:

storytime, children, librarian, reading, children, library, story hour, librarian, People gathered in a room, children, shelved books, woman, picture book

“Aboutness” tags assigned:

children’s services, user services, outreach, Information Literacy, School librarian reading to her kids, Librarian reading Miss Spider book in a library


americanmemory1“Ofness” tags assigned:

american memory project, history, American Memory, welcome page, Thomas Jefferson, web page displayed in old IE browser, database search interface

“Aboutness” tags assigned:

digital collections, user friendliness, public access, tax dollars at work, Amazing digital photo collection, American Memory database search interface


Upon completion of the survey, I realize that there are some things that I would do differently were I to do the survey again.  I would leave off the two weaker examples at the end (the storytime image and the American Memory image) because their need for a specialist is less definite.  Also, to better assess how effective subject specialists are at assigning “aboutness,” I would survey a control group of non-specialists.  It would be interesting to compare both the “Ofness” and “Aboutness” tags from the two groups.  A survey with a control group might do a better job of highlighting the difference between specialists and non-specialists and therefore better point out the benefit of cooperative tagging done by a group of subject specialists.

However, this survey does (despite the small number of respondents) begin to point to the value of tag-based indexing in a specialized environment.  There are many good tags assigned in the “Aboutness” sections that would aid in discovery, were an information specialist searching for like images.  There were also a number of instances of agreement in tagging.  I think the results begin to show that cooperative tagging by subject specialist ought to, at the very least, be considered as a one method for indexing a repository of images.

Taxonomies and Folksonomies: Not Sworn Enemies

February 23rd, 2009

spy-vs-spy1

It seems like a good majority of literature on Folksonomies and Taxonomies divides the “issue” into two camps. These two camps then proceed to lob grenades at each other. The folksonomy camp accuses the taxonomists of creating highly subjective and arbitrary semantic relationships that further cultural bias, force users into sometimes awkward or arcane jargon, and remain rigid while information changes. The taxonomic camp accuses the folksonomists of creating “sloppy” systems that are imprecise, rife with ambiguities, and poor for returning all relevant records. In regard to the charges leveled in both directions, I have the following comments:

  • The moniker of “sloppiness,” quickly thrown at folksonomies, is grandly overstated. The research done in the article “Folksonomies: Tidying up Tags” clearly shows that some tags gain critical mass and can be identified as culturally agreed upon. One-time tags, be they junk, whitenoise, or just simply unpopular tags, get weeded out in organizational visualizations like tags clouds. They fade to the back. Core terms become recognized as such by the will of the group.
  • More tags actually improve accuracy! The bigger the herd, the more apparent commonalities will be with a larger number of tags. Also, with more contributors, there will be a greater diversity of less-popular tags. This is not actually a bad thing. With more access points, users can think about an indexed item in more ways and still reach the same item. Clay Shirkey makes a good point in saying:

Tagging gets better with scale. With a multiplicity of points of view the question isn’t “Is everyone tagging any given link ‘correctly’”, but rather “Is anyone tagging it the way I do?” As long as at least one other person tags something the way you would, you’ll find it…

Not all one-time tags are junk. Some might actually be useful for someone else looking for the same thing.

  • Tags simply cannot replicate the semantic relationships in a taxonomy. There is a world of value in BT, NT, and RT relationships. Although the value gained also introduces a healthy helping of subjective bias, these relations are quite useful and cannot be replicated with tagging alone. This is a huge reason why taxonomic structure (be it pre-set hierarchy or on-the-fly faceting) remains viable and important.
  • As alluded to above, tags provide a democratic way of indexing. There is no inherent “power” doing the indexing. The power is us and reflects our naming. That said, there is a still a cultural bias within the group (a picture of snow will most likely be tagged “snow” in U.S.-based Flick’r rather than one of the 12 Inuit words for snow); however, the open format allows for group bias to be easily rectified by users (there is nothing saying that Inuit word tags can’t be added to the photo of snow in Flick’r).

So, the thoughts above bring me the following benefits outlined for each method of indexing:

  • Semantically, a taxonomy is much richer
  • Tagging is structurally more flexible
  • Tagging creates more objective tags
  • Tagging allows for many more search-access points
  • Taxonomies can (conceivable) guarantee that all relevant records will be returned

So, why not have both?

Right now, I can think of only a handful of instances where both types of indexing are employed at the same time. Amazon does it, although the tagging side of their site isn’t featured very well and hasn’t attracted that many taggers. I believe (although I’m not sure) that the University of Pennsylvania’s PennTags system allow users, once they are logged in, to view communal tags in the catalog as well as normal LC subject headings. Perhaps the brute-force computing necessary to create a system that can do both simultaneously makes the option unviable. Also it might seem confusing to pair the two, especially if the interface is poorly created.

There is, however, a good case to be made from pairing taxonomic and folksonomic structures, especially in image and video indexing. As Christine Jacobs points out in her article “If a picture is worth a thousand words, then…” images and video are akin to the novel in that “aboutness” is incredibly harder to define and far far more arbitrary. Three different indexers might come up with three different ideas of “aboutness” for a photograph (or, in that case, for a novel….what is Moby Dick really about?). In this case, an image indexing database might be well served to try to reap the objectivity of tagging while, at the same time, implement the control of a taxonomy.

Slawsky’s article “Building a Keyword Library for Description of Visual Assets: Thesaurus Basics” describes the how a DAM system can create a controlled vocabulary that is rich in semantic relationships and captures all records within a database. Great! This could be a terrific starting point. Unfortunately, this DAM system’s semantic relationships and assigned vocabularies are arbitrary and rigid. (Many of Slawsky’s examples for “BT” relationships left me scratching my head. They certainly were not intuitive for me.)

Here is where tagging can help. If a searcher is not having luck in the controlled vocabulary, why not offer the option of searching and contributing tags? Searches could be side-by-side or integrated into the same interface. Providing a second way to search could benefit the imagined cultural outsider, the neophyte, or simply the person who is thinking about a topic differently. For a person that likes following semantic relationships, is looking to expand or narrow their research, or is familiar with the system’s terminology - then the traditional taxonomy still offers that structure.

If Amazon can pull this off for everything from novels to diapers, there has to be a way to customize and improve it for large image collections. I don’t know of any image indexing system as of right now that offers both the taxonomic and folksonomic approach, but maybe it does already exist. Does anyone know? If not, it needs to be created. It seems like the two approaches complement each other well and would benefit from being implemented in conjunction - especially in a image indexing environment.