« May 2009 | Main | August 2009 »

June 30, 2009

Methodology, Part I: How to Score the Obama Administration's Information Policy

After hearing various piecemeal accounts of how the Obama Administration has been treating government information, it occurred to me that a score card could be a rather useful thing.

This is a big no-no in legal and academic writing, but I'm going to start with the caveats:

1) Truth be told, I think it's a touch early to be "grading" the Obama Administration on much. It hasn't even been 6 months since he was sworn in as President of the United States. And the executive branch of the government has a lot on its collective plate: 2 formal wars, a deeply troubled (and troubling) economy, and an international campaign against terrorism is just what we started with at the beginning of the administration and things haven't exactly slowed down since January '09.

2) How does one define federal information policy? It encompass a lot or a little. Can it even be reasonably considered a cohesive thing? Does the Obama Administration have such a cohesive policy? Are its decisions in this area being guided by a single set of principles, or are they the result of ad hoc, by-hook-or-by-crook reasoning?

3) And in a combination of the above concerns ... should the administration's information policy, as we've seen implemented for lo this almost half-year, be a priority in light of the many crises facing the U.S. right now? The country has hemorrhaged millions of jobs, with a wave of house foreclosure and credit card defaults growing, and the war in Afghanistan is growing hotter (with Pakistan being drawn deeper into the fray against the Taliban). Does it really matter if whitehouse.gov can be indexed by crawlers or if White House visitor logs are available via FOIA requests?

Now that I've set up these strawmen, let me try lighting a match ...

Information policy may include everything from intellectual property protection and enforcement to censorship to distribution of government information. For the purpose of this post, and hopefully future ones, my discussion of federal information policy will focus on the principles and implementation of policies to release or constrain information by and about the federal government, including raw data, publications, government-funded experts and logs of government activity. This is a general sphere of government information that may be released voluntarily, be subject to statutory obligations for dissemination, or is/once was available to the public via FOIA.

I will also admit a bias here: I think government openness, transparency and accountability are important values and my intent is to focus on issues and policies that contribute or directly influence perceptions of those values. In truth, a lot of what I intend to look at is already covered by a wonderful resource and community, Free Government Information (FGI). I cannot replicate the breadth of what they do, nor the depth. But I want to take a step back, take a mosaic view of the range of policy and actions ... to see the big picture.

It is rather soon to start grading the Obama Administration (for this post. from herein, I'll use the term "Obama" to refer to the entire administration as opposed to the man himself). But it is not too soon to a) laud Obama; b) criticize Obama; and c) push Obama for more. And rather than do this willy-nilly, it would behoove those of us who care about access to information and information policy to look at this as a cohesive policy, with guiding principles and objectives. If for no other reason, even spur-of-the-moment decisions can take on the weight of precedent and what may be a single action one day may become a trend months later and codified years later.

And it's hard to see how 'accidental' policy promotes the values of openness, transparency and accountability.

So, the purpose of this scorecard: to create benchmarks for federal information policy actions that could be used to evaluate individual actions and general Administration trends, and be used as a basis of discussion and advocacy in this area across multiple stakeholders in the information community (librarians and information professionals, researchers and scientists, the general public, government agencies themselves).

Or at the very least get people talking about these issues. Pushing for more openness and transparency may be necessary, and we need to know what we are talking about if we want attention to be paid.

How will I do this? Basically, look at relevant incidents and evaluate them. Sounds simple, doesn't it? I think the details will be in another post.

June 24, 2009

Anyone want a conference registration to ALA?

My life is a little too frantic right now, and I've decided not to attend ALA 2009 in Chicago next month. Since the cancellation deadline has passed ... is there anyone out there who'd like mine? Not sure how many people see this blog, but if you are interested, please email: ms_eli@yahoo.com

Thanks!

June 16, 2009

Colin Powell and Transparency

I already did a rawblog of Gen. Powell's remarks at SLA2009. But it's taken me a bit to finally transcribe the following. The Q&A portion after Powell's speech was very short - only 3 questions. However, I got the chance to ask him the 1st question. Basically, I wanted to know what he thought about providing access to government data. And that's basically what I asked him, without providing specific details (I thought that would be too confrontational). He didn't quite understand what I was asked, so I repeated it. His answer, to paraphase:

You can't stop information flow eventually. He believes in maximum openness, maximum transparency, with reasonable limits for security, privacy, etc. But greater openness and transparency is good for society in the long run.

In government, there is a big problem with information hoarding, getting the information to where it needs to be ... there needs to be more information sharing among agencies. His own example of government wanting to keep too tight a rein on information: the military wanted to hobble GPS for civilian use - luckily, that didn't happen and GPS is all over the place (I think he said he had 3 in his car ... a Corvette ... ah, boys and their toys).

Information professionals need to push the envelope as much as we can to get information and data released and accessible to the public. Risk is inherent - pushing means getting pushed back. We need to work out how much risk each of us can manage. But the bias should always be towards sharing, towards transparency.

Rawblog: Dan Clancy from Google Book Search

SLA Public Policy Update - 6/16/09
Dan Clancy
Google Book Search

Quick overview of Google Book Search
Google's Mission
To organize the world's info and mark it universally accessible & useful

1 Consumer Product, 2 sources
Partner Program - working directly with publishers, permission granted expressly from rightholders
Library Project - working with libraries

Typical library collection
Public Domain
Less than 20%

Unclear/orphan works/out-of-print, post-1923

Definitely in copyright/in-print
Less than 10%

3 views: full (public domain), partner, snippet

Blending - adding Google Book Search results into the general search results.
Most books are getting discovered quickly, especially partner books

Google Book Settlement
Scope
The agreement settles existing US lawsuits against Google Book Search
Agreement is btw. Google and a class including all US copyright holders (authors, publishers, etc.) for books that Google has scanned or are intending to scan
Agreement only pertains to uses of these books in the US
includes intl books
does not include periodical/serials

Overview
Extensive process to notify rightholders of the agreement
Rightholders have choice as to their participation
* Opt out of the settlement
* Remove books from scanning
* Select desired access models
Google is authorized to scan, index & make non-display uses for all books
A set of Access models are defined for providing access to the content of the books
For in-print books, the access models are by default off, for out-of-pring books the access models are by default turned on.
Books Rights Registry formed as an indy org to represent rightholders and to collect and distribute income (held for 5 years)

Access Models

Preview Uses (current model)

Online Consumer Access
- Enable user to buy online access to single work
- Default pricing set by Google using algorithmic pricing model to maximize revenue for each individual work

Institutional Subscription
FTE pricing for universities and other groups to buy for their users

Public Access Terminal
One free on-site terminal for all public and university libraries in US

Additional models:
Print-on-demand, etc.

Expanding Access to Knowledge
Anyone, anywhere in the US will be able to search, preview & purchase millions of out-of-print books through Book Search
From snippet view (3 snippets only, user must get physical book) to preview & purchase (institutional subscriptions & public access service license)
Less people are hunting down the physical books, and they aren't easy to access

Additional Topics / Components of the Settlement
Books Rights Registry
- Claiming data
Research Corpus (text and book research methods)
Public Domain and Government Works
What happens if Google goes away
All the library partners have the right to get the scans of the books and find an alternative partner to make them available

Questions:
What does a hit on the Book Rights Registry mean?
The registry is a superset of metadata from various sources, but doesn't mean that the book HAS been scanned or WILL be scanned.
There is some underclustering of various editions of the same work

If a rightsholder wants to opt out, does he/she/it have to cite every title & edition?
No

How does the Google Book Rights Registry compare with what the Copyright Office is doing?
BRR is an independent rights database, but it does not supersede/supplant the registration info held by the Copyright Office.

Can international users access books via the GBS?
Google wants to provide access to international users of international books

The settlement was fostered to help develop a comprehensive rights database that will also help other providers
Google wants to foster competition and is pro open-source, open-access, open-s
The BRR is non-exclusive

Problem of orphan works bigger with photos, pamphlets, ephemera than with books, but Google still supports orphan work legislation

What does "85% access" mean?
Failsafe provision: certain services that Google has to make available - search, Institutional Sub, free terminal, Find in Library
If less than 85% of the books are not made accessible/available, then an alternative provider may step in and take

Who and what will decide what ends up available? How to deal with censorship?
Google has no intention to censor their material
Plans to make all scans, barring opt-out by rightsholders, available

Creative Commons?
Still figuring out a solution, but plans to recognize CC-licensed works for appropriate level of display/access

Government works?
Hasn't been fully displaying fed docs because potential liability over copyrighted works excerpted in a gov doc
Settlement allows Google to treat fed docs as being in the public domain

Google also working on making works available in different language and for those with "print disabilities"

What is the likelihood of libraries allowing other providers to scan works? Would those alternatives be covered by the Settlement?
There is lots of replication and there are other scanning initiatives. Google thinks this encourages more scanning initiatives and competition, but they would not be covered under the settlement

As material goes into the public domain, how will Google manage the change?
Google has already distributed the rights renewals registry for works between 1923-1960
Once material goes into the public domain, it's not part of the BRR ...
Still looking for a scalable method of finding more public-domain books

Google Book Search will become a monopoly because no one will be able to catch up (question from audience)
Google is investing in this and they believe that others should
They believe that this project is covered by fair use, and the settlement does not erode Google's position on fair use
There are other players in this,

What of privacy?
Privacy wasn't discussed in the Settlement because it didn't seem like the right conversation to hold with publishers and authors
The right way to do privacy is an agreement between Google and users
Libraries aren't rolling over the privacy issue, even with the current agreements
Finding a balance btw. privacy, security and user features
Google is still designing the system
Not planning to have different privacy policies with different organizations - engineering nightmare

What will happen with user data?
No individual authentication for the Institutional Subscription

Library trust towards Google - are we partners or parts of the PR campaign?
Google is used to having their products embraced by so many users so quickly
Will be working on developing relationships with partners and user institution

Details about the public access terminal?
It is a partial solution for access in public libraries.
1 terminal per building, minimum, per Settlement
The BRR may ultimately provide more
Will not be a dedicated terminal (i.e. only for GBS), but will be authenticated as the free access terminal

Pricing - what will keep the Institutional Subscription affordable?
Pricing needs to satisfy 2 objectives: 1) get revenue for rightsholder and 2) ensure broad distribution
Long tail product -- getting students to see and like the product, eventually become consumer purchasers
Google sees the brand being part of the bottom line, doesn't want to lose customer loyalty

Clancy: one of the things that's most exciting about this is the ability to expand the collections of libraries so that someone in Brownsville, TX can access some of the same material as a student at Harvard, Michigan, Stanford, etc. Hopes people will be encouraged to go to the primary source.

What is Internet Archive doing? Does the settlement affect what it's doing?
Most of what IA has scanned is public domain but has started scanning post-1923 orphan works. Google doesn't know what an ophan work is and doesn't want to create new ones.

What happens in 10-15 years if Google isn't doing as well?
The largest costs are the upfront/scanning costs, so the question in 10-15 years is whether Google will continue to provide access.

What happens with new works?
Will probably work with publishers via the partner program

June 14, 2009

Rawblog: Colin Powell

Colin Powell - SLA Opening Session, June 14, 2009

Brought his team of info professionals from National Defense University

State Department 2001 - Powell's staff was still using Wang computers
44K computers bought
Every desk in every embassy & office got a computer
Would do inspections of whether staff were actually using the computers

It's a transactional world, information flies at the speed of light

Recommendation:
Clay Shirky - Hear Comes Everybody

Cites:
Flash mobs (see clips on YouTube)

"I'm born analog, I got me a $59 converter, and now I'm digital ..."
Grandkids won't answer email, Powell has to text/Tweet them ...

"Who still uses bookmarks? I don't use bookmarks, it's all in Google"

What does Powell miss about being Secretary of State? His plane.

Like us mortals, Gen. Powell has been dog-sniffed and wanded at the airport ...

Role of leaders is to embue followers, the people who get things done, with a sense of purpose and demonstrate passion for the mission of an organization ...

Leaders also need to know how to prune an organization

Trust is the essence of success in an organization

Crashing ...

I have a weird idea of fun. So I crashed the SLA Public Policy Advisory Council meeting. Lo and behold, it was fun, informative, etc. I'm not going to put any of the members, or Doug Newcomb, SLA's one-man band on policy policy issues, on the spot, so I just listened rather than taking notes.

But I am SO pleased at the increasing role and visibility SLA is taking on issues of public policy where libraries and access to information are a concern. Its mission is not as wide-ranging as ALA's advocacy, but it has a solid yet flexible platform, knowledgeable and engaged members on the advisory board, and a good dissemination point for its message via the Public Policy Connections blog.

If you're attending SLA2009, I know how jam-packed one's personal schedule can be, but if you can, PLEASE make time to come to the Tuesday Public Policy Update session, from 11:30 am to 1 pm @ Room 143A. Special guest Dan Clancy from Google Book Search will be speaking about the Google Book Settlement. Even though I work mere yards from Google (*grin*), this is unexpected access, and if you care about the issues surrounding the settlement, come give your imput!

June 12, 2009

In DC

Attending SLA 2009. Overnight flight, taking it easy today, conf. starts for me tomorrow with a tour of NPR.