« September 2007 | Main | November 2007 »

WHAT GETS VIEWED? An exploratory study of large IR collections

In my work circle there has been a lot of talk about growing our institutional repository. There is a big push to add meaningful content. The thing that I always get hung up on though is usage. I’m very interested in what people find useful, and my feeling is that if I’m going to pitch this service to my faculty, then I need to prove to them that the stuff is actually being seen, rather than simply offering them a theoretical argument about why open access is good and big publishers are evil.

So I decided to do a mini study. I wanted to see what the top items viewed were across several universities. I used ROAR to identify DSpace collections in the US, and then sent emails to the libraries with the 10 largest collections. One library never responded, another (MIT) shot me down with ““I'm sorry to report that our staff is unable to provide that data at this time”—but all the others provided me with a list of their top 20 most viewed items. (Thanks!)
I should note that Georgia Tech and U of Oregon were the only organizations in this sample that allowed open access to their statistics.

The results were very eclectic, as expected, however there were definite themes that emerged. For example, the U of Rochester included many musical scores, U of Michigan was heavy with engineering technical reports, Ohio State had numerous articles from The Ohio Journal of Science, U Oregon featured NewBreed Librarian articles as well as classic texts from Shakespeare, Milton and others, while Oregon State included several environmental topics. U Maryland had the most diverse materials and is unquestionably the heaviest used collection within this sample.

Someone should publish a scholarly article about this and perform a detailed synthesis on these collections, but in the mean time, here are the top viewed items from each of the collections:

  • Delivery of DNA and Recombinant Infectious Bursal Disease Virus Vaccines in Ovo (dissertation), 34,768 hits, U of Maryland
  • How Do I Do This in ArcGIS/Manifold?: Illustrating Classic GIS Tasks, 18,636 hits, Cornell
  • Relaxation studies in the muscular discriminations required for touch, agility and expression in pianoforte playing, 8,764 hits, U of Rochester
  • A study of the role of carbon in temper-embrittlement and the effect of temper-embrittlement on the fatigue properties of a 3140 steel, 7,155 hits, U of Michigan
  • Dragonflies Taken in a Week, 6,650 hits, Ohio State
  • Measurement of delignification diversity within kraft pulping (dissertation), 5,517 hits, Georgia Tech (current year only)
  • NewBreed Librarian ; Vol. 2, No. 4, 2,093 hits, U of Oregon
  • Estimating the weight of plywood, 500 hits, Oregon State

There is definitely a lot of long-tail action going on too. Most of the repositories featured one or two heavily used items, but then dropped off drastically.

Umaryland_sample_ir_long_tail_2

Some questions:

  • Why is the U of Maryland IR used so heavily? Their top 3 items blow away everyone else (34,768 hits; 32,916 hits; and 32,214 hits respectively)
  • How are people finding this stuff? Google? Native Searches? Catalog Searches? Direct Links? We need to run an analytics program.
  • How many of these hits are from web crawlers or related software?
  • Why the long-tail? What makes those top few items so popular? And just how long is the tail? Could you say something like 90% of everything in our IR was viewed at least once over the past two years?
  • If you place your IR within your metasearch tool, will it pad your results?
  • Is there a big difference between views and downloads?
  • Why does the DSpace interface still look so mid-1990’s?
  • How are items obtained? Is it piecemeal or more systematic? Are we building collections or is it random take-what-we-can-get?
  • What is the percentage of dissertations? (or, take away dissertations and what have you got left?)
  • What non-text items are collected (mp3, videos, jpg, etc)?
  • Leaving the  big vision rhetoric aside, what is the goal of each IR?
  • How do you measure the success of an IR? Is it volume or downloads or something else?

(If this is your area and you want to work on something together, let me know. I'm devoted to ALA Editions right now, but I'd like to continue this project into 2008.)

In the works...

I'm heading down to Florida to give my last talk for a long long time. It will be nice to come back home, get into the workshop, close the door, and build a few new things. Here is a sneak peak at some current projects.

New_projects_georgia_tech

Next week I'll have a two-part post on Institutional Repositories: my promotion scheme & a mini-study on use

Fairfield U Goes Experiential

Picture_12 Picture_16Picture_21Picture_18_2

I got a sneak peak at Fairfield Beach, an instructional tool that the Fairfield U library is using this semester. I have to claim some influence since I shared my “Laguna Beach in a Library” project with one of the screenwriters. The gist of the project involves the experiential narrative -- showing how the library fits into the context of the student life, as opposed to glorifying the librarian or collections. In their video, the library is the setting, or perhaps is arguably a character itself, yet the students are the stars.

The librarians at Fairfield U put an interesting spin on it by embedding a choose-your-own-adventure approach in which the protagonist is presented with options and the audience must choose her path.

Menu

They are using the video with instruction sessions this semester. Previously they handed out maps/floor plans and talked students through the various services that the library offers. The video approach on the other hand engages the class (they make the decisions on what to do next) and it attempts to provide a more interesting delivery of the material—it also follows a romantic storyline.

The script was written over the summer by two librarians: Leslie Porter and Ramona Islam. They turned it over to the Fairfield media lab where a student (Bob Cammisa) pulled together the cast and filmed the project.

Don’t you wish you could watch it? Well it's not online—and if it’s not online then it doesn’t exist, but here is a leaked demo. (UPDATE: Fairfield asked me to remove the link to the video clip.) For an Abercrombie school like Fairfield this type of presentation works well, plus the users are the stars, not the librarians -- sorry L-team. But seriously, if you make a video, put it online so that it becomes a discovery tool—this isn’t the 90’s—everything is web-based now!

If you want more information about the Fairfield Beach video tool email the reference staff.

For more library orientation videos, see: U Dayton & UVA (also see The BC, non-library content, but same genre.)

Special thanks to Dottie Hunt, who constantly shows me why I should be using a Mac.

Librarians think like trees (a photo commentary)

Librarians_are_like_trees_talking
The problem with librarians is that we think like trees instead of as a forest. Are we driving ourselves into extinction by failing to see the big picture?

Forest_destruction_libraries

See also:
Can we ever really move beyond the self-centered library?

Notes from Vint Cerf @ Georgia Tech

Vint_cert
Vint Cerf, a founding father of the internet, gave a talk at Georgia Tech. Here are a few if my notes, but they were a bit messy because I had to write in pitch dark.

  • Works for Google, they asked him what he wanted his title to be, he said “archduke” – they ended up going with Internet Evangelist.
  • Cited stats that internet has 1.1 billion users--- but that there are over 6.5 billion people on the planet—so while many people use the web it’s not as universal as believed.
  • Pushed the philosophy: IP on Everything – meaning that everything on the internet would be on a single protocol, as opposed to TV (designed just for TV), radio (designed just for radio), etc— these mediums all had special and limited purpose, yet the internet would be application independent – everything would work regardless of what it is—isolate the application. In this sense the network is insensitive – it doesn’t care what the object it—it’s just a baggage of packets—the internet doesn’t discriminate against types of information.
  • The internet is an experiment that never ended.
  • We’re going to run out of IP addresses in 2011. Trying to release IPv6 to increase range, but there are lots of problems. Essentially going from the original 32 bit system to 128 bit system. Of course, we think this should be enough today, but in 20-30 years will we wish we had more?
  • There has been a sea change with the use of the web—it’s become a broadcast medium, rather than just reading and downloading. The Consumer is the Producer.
  • A user can make a small change or contribution to Wikipedia and have a substantial impact – not possible in the past.
  • There are zero barriers on the web; nearly unlimited opportunities to inject content/comment.
  • It’s fascinating that players of virtual games, such as World of Warcraft, will pay others to “play” their character for them so that when they do log into the game they have high skills, better equipment and an overall better experience. (Why suffer through the lower levels when more advanced characters have all the fun?)
  • Movie studios are moving toward digital players in movie theaters because it’s much less expensive than reel-to-reel film, and less labor intensive. Have to produce 3000 copies of reel film for blockbusters, plus ship them, fix them, etc.
  • You can’t stop illegal downloading—movie studios need to consider alternative revenue streams-- (he’s mentioned this to them, but they don’t listen) His suggestion is for them to embed media content through product placement, for example, you download a movie and see a Mac that the character is using, click on it to link to information about the computer. You like a shirt, car, or song, pause and click, buy. He also mentioned the potential of selling ancillary materials such the e-book that a movie was based on, as well as bios, PR materials, etc, similar to DVD bonus material—build it for web with goal to link it with other stuff, rather than sitting alone on DVD.
  • In Africa, growing use of mobile phone minutes as currency. They trade minutes like money, a micro-economy is emerging.
  • Mobile computing is the future – the hub of communications.
  • There is going to be a rise in geo-location services. GPS is the next killer app. Mobile devices connected to inventory of stores. Direct access to everything. Information questions are based upon where we are. Also, you can access information based upon who you know. Questions are based on: time, location, and personal universe.
  • There is an explosion of web enable devices for mobile access, particularly in Japan. Not just devices to get online, but add new functionality.
  • Digital picture frame – you can connect it to a flickr account (or other service) and it will update to the latest image. So the grandparents can always have the latest pic of your kids.
  • RFID enabled refrigerators—it knows what’s inside and it can suggest meals based on the ingredients; it can also crawl the web for recipes.
  • Internet enabled surfboard. Predicts wave patterns, gives you real time data.
  • SMS text messaging washing machines.
  • Bathroom scale sends info to your doctor, could be connected to refrigerator to help you keep your diet.
  • There is a problem with long term preservation. Files will become unreadable. The popular notion is that any material that is valuable or important will be upgraded to the newer formats, but much of the information produced today will not be retrievable tomorrow. We need to preserve operating systems, as well as hardware and software too.
  • If software is no longer supported it should become public domain.
  • In the long term, like the year 3000, the web will be a sea of uninterruptible data. Kind of like the ancient Egyptians or Stonehenge.
  • Working now on the interplanetary internet. (IPN) space based communications. There is a long delay in communications between Earth and Mars. When the planets are the closest it takes 4 minutes to send packets, when furthest 40 minutes.  The problem is that they are always moving and resulting in timeouts, delays, and other random connection problems. LA and NYC are far apart, but they are stable, they don’t move.
  • Working with NASA and internationals to equip satellites and other space vessels with standard protocol for web communications. Things that get sent into space have a limited “research mission” and then they are done—could be used as web hubs/routers. Example: Mars Rover. The original mission was 90 days, but 3 years later still there and functional. If they could enable Rover to bounce packets to satellites, would speed up communications.

           Ipn

  • Google doesn’t want to get into censorship, therefore does provide gmail or blogger in China. They don’t want to just their government, but since they comprise 1/5 of Earth’s population have to be considered. Can’t just ignore because Communist, have to find a way to work with them. Feels the web will help spread democracy.
  • Two regrets about internet: 1.) wished he had picked longer bit count for IP address, 2.) wished there was stronger authentication between devices (digital signatures)
  • When the internet started it was just a proof of concept—they had hoped to take it down and launch a production model later, but pressure of government to make it work.
  • People think that we knew what we were doing or that we had a grand plan or vision—but we didn’t, it was just theory, just an experiment—we didn’t even think it would work, didn’t think it would connect.
  • Throughout the 1970’s and much of the 1980’s, he had absolute control of the internet. But only 400 computers were connected. Forced all users to adopt the TCP/IP protocol, or else he kicked them off.
  • The web has unleashed a sharing phenomenon.
  • Advice for students: look for breaking points—look for things that are unsustainable and then become part of the solution.
    • Fish are going extinct = huge loss of food (shortage to come)
    • Oil = what’s the future of transportation?
    • Health Care = soon to crush the US
    • (How about academic journals or even academic libraries in general? He didn’t say that, but I’m just saying)

Look for things that are going to break and then try to imagine what’s next. Look for things that are visibly unsustainable and then position yourself can fill the gap.

Sidenote: GT Students should feel ashamed for not coming out. There were less than 200 people in attendance and this guy really invented the internet, unlike you know who.

Sidenote 2: He's talking about the internet in space and I can barely get a phone signal in my bedroom?

Daylight Savings as an excellent PR opportunity

Finally the weather is tolerable here in Atlanta; Fall is my favorite season. With this change comes the end of daylight savings. This is a great chance to pitch security. Just as all these corporations are jumping on the go green bandwagon, libraries can send a message about safety.

My library is located in an urban environment. Crime happens. Why not use this opportunity to reinforce an image of the library as a “safe” place? We have a security officer who checks IDs at the door and another one roaming the building, we also have a campus shuttle service that takes students to their dorms at night until 2am. These are things we already do, but we don’t talk about them publicly. By creating a media package, we can get this word out in conjunction with the end of day light savings (aka: shorter days) and sell the library as a safe haven.

Many of us spend so much time promoting resources, equipment, policies, reference help, etc but what about peace of mind? Promoting security makes us look responsible—and, as is our case, responsive to recent concerns about crime in the area. This effort makes us look like we’re addressing actual current needs, when really we’ve been doing this all along.

It invites the potential for wide collaboration too:

  • Maybe Student Government or some other group could coordinate a “walking together” program—so that at the top of each hour (after 10pm) students could meet outside the library to walk over to dorms or distant parking lots. (This is directly in line with police recommendations that people travel in packs.)
  • Maybe there could be some type of partnership between the library and housing? Perhaps we can deliver a midterm package informing students about upcoming events, workshops, all the usual library stuff, but also include something about safety?
  • The library and the transportation office? They run the campus shuttles—we could push that info to students, encourage them to add the number to their cell phones or something to that effect. Maybe something with campus bus routes adding a few “night” time spots. We also want to find ways to get them too the library (not just home from it) so perhaps some transportation possibilities there as well? Door to door and back again.
  • Maybe invite campus police to hold some safety seminars or whatever within our presentation space? Even if students don’t attend we gain positive PR.
  • Maybe write a letter to the editor of the school paper laying out this “safety” theme stating that since we’re open late this is a concern of ours and that the library is working with all these other units on campus to ensure they have an environment that is comfortable and safe… etc.

It’s all about perception and media spin, so consider an awareness message that makes the library look good. It doesn’t always have to be about books, journals, exhibits, and computers—think big picture. What’s the patron thought process? I am focused now at trying to find out what deters them from visiting the library and then implementing solutions—how can we create a  motivational message? Or rather, the right message at the right time for the right person.

If I can gently remind users that the library is open 24 hours, packed with assistance (from librarians and IT specialists, to multimedia assistants, teaching assistants, and peers), filled with technology and resources, invites eating, offers both quiet and noisy study space, and the occasional exhibit, lecture, or other “intellectual” distraction—combined with the perception that it is not only safe, but activity seeking to ensure their security--- I feel I could raise both gate count and user satisfaction simultaneously.

Many libraries, particularly ARL’s, focus on promoting excellence in collections, but I am far more interested in promoting an excellence library experience.

About Brian

My Photo

My Online Status

AddThis Social Bookmark Button
Blog powered by TypePad
Member since 05/2006