Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
LLC
Todays agenda
9:00-9:10 9:10-9:15 9:15-9:45 9:45-10:00 10:00-10:30 10:30-11:00 11:00-12:00 12:00-12:30 12:30-13:30 13:30-14:30 14:30-14:45 14:45-15:15 15:15-16:15 16:15-16:30 16:30-17:00 10 minIntroduction 5 minWarm-up exercise 30 minTaxonomy fundamentals: Building taxonomies 15 minTaxonomy exercise 30 minTaxonomy fundamentals: Taxonomy business case 30 minTea Break 60 minTaxonomy governance 30 minCapabilities self-assessment 60 minLunch 60 minTaxonomy benchmarking 15 minBenchmarking exercise 30 minTea Break 60 minContent tagging 15 minTagging exercise 30 minQ&A
2
y Metadata and taxonomies community leadership. President, American Society for Information Science & Technology Director, Dublin Core Metadata Initiative Adviser, National Research Council Computer Science and Telecommunications Board Reviewer, National Science Foundation Division of Information and Intelligent Systems Founder, Networked Knowledge Organization Systems/Services
Taxonomy Strategies LLC The business of organized 3
What we do
Organize Stuff
the properties needed to describe content so that it can be found & used. y Vocabularies are collections of terms that are used to specify some of the metadata properties.
Some vocabularies are big
what metadata & vocabularies are required, and then represents them formally.
Taxonomy Strategies LLC The business of organized 5
Not-for-Profit
Industrial Sector
y Agriculture & Processing Food, Lumber, Pulp & Paper y Financial Services
Banking & Insurance
y Government
Public administration Public safety
y Manufacturing Consumer Products y Medical & Health Care y Mining & Refining Petrochemicals, Oil & Gas y Pharmaceuticals
workshop? y Please rank the questions from the most important (5) to the least important (1) y Please provide your job title, organization and department; your name is optional.
Priority (1-5) Questions
Your title or role: Your org or industry: Your dept: Your name: (optional)
Todays agenda
9:00-9:10 9:10-9:15 9:15-9:45 9:45-10:00 10:00-10:30 10:30-11:00 11:00-12:00 12:00-12:30 12:30-13:30 13:30-14:30 14:30-14:45 14:45-15:15 15:15-16:15 16:15-16:30 16:30-17:00 10 minIntroduction 5 minWarm-up exercise 30 minTaxonomy fundamentals: Building taxonomies 15 minTaxonomy exercise 30 minTaxonomy fundamentals: Taxonomy business case 30 minTea Break 60 minTaxonomy governance 30 minCapabilities self-assessment 60 minLunch 60 minTaxonomy benchmarking 15 minBenchmarking exercise 30 minTea Break 60 minContent tagging 15 minTagging exercise 30 minQ&A
9
The Taxonomy problem: How to pick from > 5,000 faucets? By:
y Category y Price y Brand y Color/Finish y # Handles y Series Name y Water Filter? y Faucet Spray y Handle Shape y Soap Dispenser?
Taxonomy Strategies LLC The business of organized 10
y When do the
things in the list change? y How do we maintain the list? y What rules do we follow?
11
Conduct interviews
Identify, gather & review resources Define fields & purpose Define content chunks & XML DTDs Compile controlled vocabularies Develop workflow, rules & procedures Manually tag small sample
12
Beta D&T
Interview alpha users Gather additional sources, if any Modify CMS for beta
Final D&T
Interview beta users
Define content chunks & XML DTDs Compile controlled vocabularies Develop workflow rules & procedures Manually tag small sample
Modify CMS for beta Revise, use in beta CMS Modify & extend workflows
Modify for 1.0 Revise using team procedu re Finalize procedure materials
13
effort to develop an equivalent taxonomy. y But pre-existing taxonomies rarely fit an organizations needs and may require extensive customization.
Recommendation
y Adopt a faceted approach. y Reuse existing (especially internal) vocabularies for as many
of the facets as possible. y Plan on doing full-custom Content Type and Topic taxonomies.
14
Definition
Organizational structure.
Potential Sources
SP 800-87, U.S. Government Manual, Your organizational structure, etc.
Structured list of the various types of Dublin Core Type Vocabulary, AGLS Document content being managed or used. Type, Your records management policy, etc. Broad market categories such as lines of business, life events, or industry codes. Place of operations or constituencies. Business activities or functions performed to accomplish mission and goals. Business topics relevant to your mission & goals. Subset of constituents to whom a piece of content is directed or is intended to be used by. Names of products/programs and services. SIC, NAICS, Your market segments, etc.
Industry
Location
FIPS 5-2, FIPS 55-3, ISO 3166, UN Statistics Div, US Postal Service, Your sales regions, etc. Federal Enterprise Architecture Business Reference Model, Enterprise ontology, Your business functions, etc. Federal Register Thesaurus, NAL Agricultural Thesaurus, Your research areas, etc. GEM, ERIC Thesaurus, IEEE LOM, Your psycho-graphics or personas, etc. ERP system, Your products and services, etc.
15
16
conversion to attributes. 2. Repeated lists of subcategories signal a possible need for an attribute. 3. The number of attributes should not exceed six or seven, so not all attribute candidates should be used.
Avoid selecting strongly correlated attributes, such as Weight and Shipping
Weight.
Priorities
1. Choose Categories that apply to many products, over those with few
products. 2. Choose Attributes that apply to many Categories over those that apply only to very few categories.
17
Products
Accessories Content Phones Services Batteries Cases Chargers Data Hands-Free Headsets Miscellaneous Purchased Subscription Versatile Phones Smart Devices Basic Phones Prepaid Phones International Only Phones Mobile Broadband Cards Conferencing Internet / Data Landline Phone Network & Roaming Relay Services Solutions Wireless Data
18
Brand
Canon (15) Fuji (10) Kodak (17) Nikon (8) Olympus (9)
Merchandising attributes Usage (E-mail, Internet Browsing, Programming, ) Segment (Home, Business, Education, Government ) Region & Country Most Popular New Related Products Specialized attributes Capacity (Battery; Memory; MB; GB; BPS, ) Resolution (DPI; Megapixels; XGA, XGA, UXGA, ) Size (Display; Screen; ...) Standard (a, b, g, n, ; scsi, ata, sata, eide, ; dimm, simm, ) Type (Camera; Battery; Display; Printer; Server; Storage; Switch; )
Taxonomy Strategies LLC The business of organized
Type
Point & Shoot (25) Digital SLR (10) Packages (5)
Price Range
$100-250 (5) $250-500 (16) $500-1000 (19) More than $1000 (3)
19
granularity? Not as many as you think! y Post-coordinate indexing allows several simple controlled vocabularies to be combined, rather than using a single large pre-coordinated vocabulary.
20
nodes each have the same discriminatory power as one hierarchy of 10,000 nodes (104)
Easier to maintain Easier to tag by content authors Can be easier to navigate
Audience
Advocacy Contractors & Grantees Environmental Professionals Federal Facilities General Public Industry Kids Researchers & Scientists Small Business Students
Health
Advisory Exposure Food Safety Health Assessment Health Effect Health Risk Occupational Health Pesticide Effects Sun Protection Toxicity
Industry
Agriculture & Cattle Automobile Repair Chemical Dry Cleaning Electronics & Computer Energy Extractive Industries Food Processing Leather Tanning & Finishing Metal Finishing
Substance
Allergen Biological Contaminant Carcinogen Chemical Explosive Liquid Waste Microorganism Ozone Pesticide Radioactive Waste
the number of facets, than to increase the number of terms per facet.
21
22
y Software does a poor job of De-duplication. Turning significant words and phrases
Sort flickr categories into 5 or fewer groups. Then label each group.
Taxonomy Strategies LLC The business of organized 24
25
Sort flickr categories into 5 or fewer groups. Then label each group.
Taxonomy Strategies LLC The business of organized 26
Todays agenda
9:00-9:10 9:10-9:15 9:15-9:45 9:45-10:00 10:00-10:30 10:30-11:00 11:00-12:00 12:00-12:30 12:30-13:30 13:30-14:30 14:30-14:45 14:45-15:15 15:15-16:15 16:15-16:30 16:30-17:00 10 minIntroduction 5 minWarm-up exercise 30 minTaxonomy fundamentals: Building taxonomies 15 minTaxonomy exercise 30 minTaxonomy fundamentals: Taxonomy business case 30 minTea Break 60 minTaxonomy governance 30 minCapabilities self-assessment 60 minLunch 60 minTaxonomy benchmarking 15 minBenchmarking exercise 30 minTea Break 60 minContent tagging 15 minTagging exercise 30 minQ&A
27
28
framework for organizing the ever-growing and changing information within a company. The many dimensions of taxonomy can greatly facilitate Web site design, content management, and search engineering. If well done, taxonomy will allow for structured Web content, leading to improved information access.
29
to users in some way that cuts costs or improves revenues. y Putting taxonomy into operation requires UI changes and/or backend system changes, as well as data changes. y You need to determine those changes, and their costs, as part of the ROI.
30
31
32
interface for a browsing task, and a direct search interface when they knew precisely what they wanted.
33
Time saved: Taxonomy compared to search 1 hour per day searching x 36% faster = 22 minutes each day
22 minutes x 250 working days per year = 5500 minutes or 92 hours per year
35
Benefit:
36
Sue Feldman,
y Suns usability experts calculated that 21,000 employees
were wasting an average of six minutes per day due to inconsistent intranet navigation structures. When lost time was multiplied by staff salaries, the estimated productivity loss exceeded $10M per yearabout $500 per employee per year.
Jakob Nielsen, useit.com
37
Knowledge workers spend up to 2.5 hours each day looking for information
Communicating
Searching
Creating
But find what they are looking for only 40% of the time.
Source: Kit Sims Taylor
Taxonomy Strategies LLC The business of organized 38
Knowledge workers spend more time re-creating existing content than creating new content
Communicating
Searching
Source: Kit Sims Taylor (cited by Sue Feldman in her original article)
Taxonomy Strategies LLC The business of organized 39
40
being used to facilitate information seeking in the workplace, and on the web. scheme (faceted navigation) when it is made available in the user interface. User Interface.
3. Hierarchical or facet navigation can be guided by the 4. Facet navigation is best combined with keyword
Todays agenda
9:00-9:10 9:10-9:15 9:15-9:45 9:45-10:00 10:00-10:30 10:30-11:00 11:00-12:00 12:00-12:30 12:30-13:30 13:30-14:30 14:30-14:45 14:45-15:15 15:15-16:15 16:15-16:30 16:30-17:00 10 minIntroduction 5 minWarm-up exercise 30 minTaxonomy fundamentals: Building taxonomies 15 minTaxonomy exercise 30 minTaxonomy fundamentals: Taxonomy business case 30 minTea Break 60 minTaxonomy governance 30 minCapabilities self-assessment 60 minLunch 60 minTaxonomy benchmarking 15 minBenchmarking exercise 30 minTea Break 60 minContent tagging 15 minTagging exercise 30 minQ&A
42
to remain relevant. y Maintenance processes need to be specified so that the changes are based on rational cost/benefit decisions.
43
44
Taxonomy Facets
CV Consumers
Site Search Tool
CV Sources
Subject Codes Codes
Project Archives
Working Papers
3: 3: Team adds value to Team adds value via definitions, through snapshots definitions, synonyms, synonyms, classification rules, classification rules, training materials, etc. training materials, etc. Internally Internally Created CVs Created
Web CMS
DMS DAM
CV = Controlled Vocabulary
Taxonomy Strategies LLC The business of organized 45
produced by a cross-functional team which includes business, technical, information management, and content creation stakeholders. y The team should plan on maintaining the taxonomy as well as building it.
Maintenance will not (usually) be anyones full-time job. Exact mix of people on team will change.
46
and consumers of the Taxonomy. Identify new opportunities for use of the Taxonomy across the enterprise to improve information management practices. Promote awareness and use of the Taxonomy.
Taxonomy Strategies LLC The business of organized
47
Keeps committee on track with larger business objectives. Balances cost/benefit issues to decide appropriate levels of effort. Obtains needed resources if those on committee cant accomplish a particular task.
Estimates costs of proposed changes in terms of amount of
data to be retagged, additional storage and processing burden, software changes, etc. Helps obtain data from various systems.
Committees liaison to content creators. Estimates costs of proposed changes in terms of editorial
query logs, indexer feedback. Makes edits to taxonomy, installs into system with aid of IT specialist.
Reality check on process change suggestions.
48
End User
Recommendations by Editor 1. Small taxonomy changes (labels, synonyms) 2. Large taxonomy changes (retagging, application changes) 3. New best bets content.
Taxonomy Editor
Taxonomy Team
3. Retagging cost.
Requests from other Requestsof NASA parts from other parts of the organization
49
change processes.
checks facts with department heads before making changes. Organization 2: Analysts suggest changes, editors approve, copyeditors verify consistency. Organization 3: Marketing reps ask for a change, taxonomy editor makes demo, web representative approves it.
50
Yes
Problem?
No
Problem?
Yes
No
Taxonomy
Analyst
Editor
Copywriter
Sys Admin
51
52
Ability to Execute
low
Completeness of Vision
53
processes around content management, taxonomy, and metadata. y Honestly assess your organizations metadata maturity in order to design appropriate governance processes. y We are starting to define a maturity model, similar to the Software Capability Maturity Model (CMM)
Initial: Ad hoc, each project begins from scratch. Repeatable: Procedures defined and used, but not standardized across
organization or are misapplied to projects. Defined: Standard processes are tailored for project needs. Strategic training for long-range goals is in place. Managed: Projects managed using quantitative quality measures. Process itself is measured and controlled. Optimizing: Continual process improvement. Extremely accurate project estimation.
54
process should be
Overly sophisticated processes will fail.
organizations methods for achieving particular goals. y Mature processes have expenses which must be justified by consequent cost savings or revenue gains. y IT Maturity may not be core to your business.
55
1 X is starting to examine search query logs, which is an important first step in improving search. But this is only an isolated example. 2 IT has a project methodology they are trying to use across all projects. But not all business units have project methodologies.
56
10.
Immature
4 5 6 Mature
10
cost/benefit analysis, or because they are easy to do with the current applications and tools? Cost/Benefit Easy been analyzed, or are major purchases sometimes made to use up year-end money? Requirements Year-End taxonomy positions? Yes No
Functionality
Data Quality
Functionality Basic
People
Data Quality
Yes
No
Advanced
1. Are there established qualitative and quantitative measures of
No
metadata quality?
Yes
No
Intermediate
1. Is there an ongoing data cleansing procedure to look for any
Yes
No
Yes
No
57
Being developed
In practice
Former practice
NA or Unknown
20% (12) 25% (15) 31% (19) 41% (25) 37% (22) 31% (19) 46% (28) 43% (26) 68% (41) 57% (34)
11% (7) 21% (13) 18% (11) 23% (14) 20% (12) 25% (15) 25% (15) 16% (10) 7% (4) 15% (9)
62% (38) 44% (27) 38% (23) 30% (18) 37% (22) 31% (19) 21% (13) 25% (15) 10% (6) 17% (10)
2% (1) 2% (1) 0% (0) 0% (0) 0% (0) 5% (3) 0% (0) 0% (0) 0% (0) 0% (0)
5% (3) 8% (5) 13% (8) 7% (4) 7% (4) 8% (5) 8% (5) 16% (10) 15% (9) 12% (7)
58
22% (13) 37% (22) 52% (30) 52% (31) 48% (29)
12% (7) 37% (22) 16% (9) 20% (12) 20% (12)
37% (22) 20% (12) 21% (12) 17% (10) 20% (12)
48% (29) 57% (34) 15% (9) 38% (23) 48% (29)
15% (9) 17% (10) 12% (7) 18% (11) 18% (11)
17% (10) 17% (10) 61% (36) 27% (16) 17% (10)
59
Being developed
In practice
Former practice
NA or Unknown
36% (21) 37% (22) 28% (16) 20% (12) 32% (19) 75% (44) 47% (28) 35% (21) 28% (17) 38% (23)
10% (6) 10% (6) 21% (12) 36% (21) 29% (17) 3% (2) 22% (13) 17% (10) 22% (13) 40% (24)
34% (20) 32% (19) 40% (23) 34% (20) 34% (20) 14% (8) 20% (12) 40% (24) 33% (20) 13% (8)
5% (3) 5% (3) 5% (3) 3% (2) 0% (0) 0% (0) 0% (0) 2% (1) 3% (2) 0% (0)
15% (9) 15% (9) 7% (4) 7% (4) 5% (3) 8% (5) 10% (6) 7% (4) 13% (8) 8% (5)
60
Todays agenda
9:00-9:10 9:10-9:15 9:15-9:45 9:45-10:00 10:00-10:30 10:30-11:00 11:00-12:00 12:00-12:30 12:30-13:30 13:30-14:30 14:30-14:45 14:45-15:15 15:15-16:15 16:15-16:30 16:30-17:00 10 minIntroduction 5 minWarm-up exercise 30 minTaxonomy fundamentals: Building taxonomies 15 minTaxonomy exercise 30 minTaxonomy fundamentals: Taxonomy business case 30 minTea Break 60 minTaxonomy governance 30 minCapabilities self-assessment 60 minLunch 60 minTaxonomy benchmarking 15 minBenchmarking exercise 30 minTea Break 60 minContent tagging 15 minTagging exercise 30 minQ&A
61
Process
Who
Requires
Rough taxonomy
Validation
Approach Appropriateness to task
Show & explain Taxonomist SME Team Taxonomist Check conformance to editorial rules Users Contextual analysis (card sorting, scenario testing, etc.) Survey Users
Walk-thru
Draft Consistent look and feel taxonomy Editorial Rules Rough taxonomy Tasks & Answers Rough Taxonomy UI Mockup Search prototype Tasks are completed successfully Time to complete task is reduced Reaction to taxonomy Reaction to new interface Reaction to search results
Usability Testing
User Satisfaction
Tagging Samples
Sample Content fit content Fills out content inventory Rough Training materials for people & taxonomy (or algorithms better)
62
Content Type
Award Case Study Contract & Warranty Demo Magazine News & Event Product Information Services Solution Specification Technical Note Tool Training White Paper Other Content Types
Competency
Industry
Service
Product Family
Desktops MP3 Players Monitors Networking Notebooks Printers Projectors Servers Services Storage Televisions Other Brands
Audience
Line of Business
All Home & Home Office Gaming Government, Education & Healthcare Medium & Large Business Small Business
RegionCountry
All Asia-Pacific Canada EMEA Japan Latin America & Caribbean United States
Business & Finance Interpersonal Development IT Professionals Technical Training IT Professionals Training & Certification PC Productivity Personal Computing Proficiency
Banking & Finance Communications E-Business Education Government Healthcare Hospitality Manufacturing Petro-chemicals Retail / Wholesale Technology Transportation Other Industries
Assessment, Design & Implementation Deployment Enterprise Support Client Support Managed Lifecycle Asset Recovery & Recycling Training
All Business Employee Education Gaming Enthusiast Home Investor Job Seeker Media Partner Shopper First Time Experienced Advanced Supplier
63
Abbreviations Ampersands Capitalization General, More, Other Languages & character sets Length limits Multiple parents Plural vs. singular form Scope notes Serial comma Sources of terms Spaces Synonyms & acronyms Term order (Alphabetic or ) Term label order (Direct vs. inverted)
Rule Name
Abbreviations
Editorial Rule
Abbreviations, other than colloquial terms and acronyms, shall not be used in term labels. Example: Public Information NOT: Public Info. The ampersand [&] character shall be used instead of the word and. Example: Licensing & Compliance NOT: Licensing and Compliance Title case capitalization shall be used. Example: Customer Service NOT: CUSTOMER SERVICE NOT: Customer service NOT: customer service
Ampersands
Capitalization
General, The term labels General, More, and More, Other Other shall be used for categories which contain content items that are not further classifiable. Example: Other Property Other Services General Information General Audience
64
Task-based testing*
y 15 representative questions were selected Perspective of various organizational units Most frequent website searches Most frequently accessed website content Correct answers to the questions were agreed in advance by team. y 15 users were tested Did not work for the organization Represented target audiences y Testers were asked where would you look for under which facet Topic, Commodity, or Geography? Then, under which category? Then, under which sub-category? Tester choices were recorded y Testers were asked to think aloud Notes were taken on what they said y Pre- and post questions were asked Tester answers were recorded
Taxonomy Strategies LLC The business of organized 65
1. Topics 1.1 Agricultural Economy 1.2 Agriculture-Related Policy 1.3 Diet, Health & Safety 1.4 Farm Financial Conditions 1.5 Farm Practices & Management 1.6 Food & Agricultural Industries 1.7 Food & Nutrition Assistance 1.8 Natural Resources & Environment 1.9 Rural Economy 1.10 Trade & International Markets
Farm Financial Conditions Costs of Production Commodity Outlook Farm Financial Management & Performance Farm Income Farm Household Financial Well-being Lenders & Financial Markets Taxes
67
User 1
Cotton Cattle Farm Income
User 2
Cotton Food Safety Farm Income Asia Cattle
User 3
User 4
Cotton Cattle Farm Income
User 5
Cotton Cattle Farm Income Diet Quality & Nutrition WIC Program Corn
US States
Food Consumption Diet Quality & Nutrition WIC Program Corn WIC Program Corn
Food Expenditures Diet Quality & Nutrition WIC Program Corn WIC Program Corn
Foodborne Disease Foodborne Disease Consumer Food Safety Food Prices Tobacco Farm Structure Food System Food Security Market Structure Tobacco Farm Structure Labeling Policy Food Security Market Analysis Tobacco Farm Structure Food Safety Innovations Food Security
Foodborne Disease Foodborne Disease Food Expenditures Retailing & Wholesaling Tobacco Farm Structure Tobacco Farm Structure
Food Safety Policy Food Prices Food Security Food Security Commodity Trade
Cropping Practices Conservation PolicyConservation PolicyConservation PolicyConservation Policy Trade Policy Food Safety & Trade WTO Market Analysis Commodity Trade
68
categories that we expected them to look for it. y Breaking-up topics into facets makes it easier to find information, especially information related to commodities.
69
% Correct % Agree
91% 73% 100% 91% 100% 100% 82% 55% 100% 91% 36% 100% 36% 91% 55% 82% 64% 55% 73% 100% 100% 82% 27% 100% 91% 18% 73% 64% 91% 36%
On these trials, only 50% looked in the right category, & only 27-36% agreed on the category.
Possible error in categorization of this question because 64% thought the answer should be Commodity Trade.
70
Topic?
Commodity?
Easy Medium Difficult
Geographic Coverage?
Easy Medium Difficult
71
Easier
Easy
-->
72
general Time performance Calculate success rates Survey subjective criteria Pay attention to survey hygiene:
7.6 6.0
7.2 6.7
7.4 5.5
se U
sy Ea
m Si
e pl
e Fl
e bl xi
s ou i ed T
In
s re te
g tin Ea sy to
w ro B
se
y jo En
e bl a O w er v
m el h
g in
Quantitative test of vocabularies 3-10X numberUse computer-assisted * of categories methods when more than 1020 categories. Pre-existing metadata is the most meaningful.
* Quantitative methods require large amounts of tagged content. This requires specialists, or software, to do tagging. Results may be very different than how real users would categorize content.
Taxonomy Strategies LLC The business of organized 76
Values
Content Types Audiences Organizations Missions & Projects Locations Business Functions Disciplines Time Period
4) Flag questions
Taxonomy Strategies LLC The business of organized 78
y Taxonomy
Converted MultiTes format into RDF for Seamark
y Metadata
Converted from existing metadata on web pages, or Created using simple automatic classifier (string matching with
terms & synonyms) 250k items, ~12 metadata fields, 1.5 weeks effort
79
80
Ex hi b
Bi
Ju ve ni le
lit
81
search index were manually categorized. Inaccessible files and junk were removed.
y Results: Slightly more uniform than Zipf distribution. Above the curve
Content Type
82
Methodology:
y 25,380 resources tagged with
taxonomy of 179 terms. (Avg. of 2 terms per resource) y Counts of terms and documents summed within taxonomy hierarchy
Results:
y Roughly Zipf distributed (top 20
terms: 79%; top 30 terms: 87%) y Mismatches between term% and document% flagged
Taxonomy Strategies LLC The business of organized
10 pre-defined categories Categories where fewer than 14 of 15 testers put product into same category were flagged
84
85
87
Query log & click trail examination Who are the users & what are they looking for?
y Only 30-40% of organizations regularly examine their
logs*. y Sophisticated software available, but dont wait. y 80% of value comes from basic reports
88
summary
89
90
91
Benchmarking exercise
y What are 5 representative questions that your users ask or tasks
that your users do when using your application? y Is it currently easy, medium or difficult to answer these questions or accomplish these tasks?
Questions or Tasks
92
enables owners, and engages stakeholders. Quick implementation that provides measurable results as quickly as possible. A means to an end, and not the end in itself. Not perfect, but it does the job it is supposed to dosuch as improving search and navigation. Improved over time, and maintained.
93
Todays agenda
9:00-9:10 9:10-9:15 9:15-9:45 9:45-10:00 10:00-10:30 10:30-11:00 11:00-12:00 12:00-12:30 12:30-13:30 13:30-14:30 14:30-14:45 14:45-15:15 15:15-16:15 16:15-16:30 16:30-17:00 10 minIntroduction 5 minWarm-up exercise 30 minTaxonomy fundamentals: Building taxonomies 15 minTaxonomy exercise 30 minTaxonomy fundamentals: Taxonomy business case 30 minTea Break 60 minTaxonomy governance 30 minCapabilities self-assessment 60 minLunch 60 minTaxonomy benchmarking 15 minBenchmarking exercise 30 minTea Break 60 minContent tagging 15 minTagging exercise 30 minQ&A
94
Tagging Overview
y Tagging is better than the words that happen to occur in a
time the content is handled and used so that it accrues value or its significance is diminished.
95
Ho wm any
peo ple f
ill t his
in?
96
Organize
Ho wm an yp eo p le
cli ck on thi s?
97
y Emergent categories
Resembles open card sort process in which patterns emerge rather than validating categories using closed card sorts.
98
y del.icio.us founder
Joshua Schachter
y del.icio.us & flickr are now both part of Yahoo! y As of April 2006 flickr had 130 million photos posted by 3
99
Use specific terms Apply the most specific terms when tagging content. But do not tag every possible topic, just the ones that are most important or best characterize the content as a whole. Use multiple terms Use as many terms as necessary to describe overall What the content is about & Why it is important. Do not over-tag. Use appropriate terms Consider how content will be used Only fill-in the facets & values that make sense. Not all facets apply to all content. Anticipate how the content will be searched for in the future, & how to make it easy to find it. Remember that search engines can only operate on explicit information.
100
Agenda
y Content Tagging y Tagging Interface
101
others) Controlled vocabularies, e.g., with pull-down list Multi-valued tags Geo-tagging Group tagging Clean-up tag tools, e.g., alpha list Batch editing Share/Dont share (Public/Private) Identified owner (who can be emailed) Almost immediate feedback, e.g., tag cloud
102
103
104
105
106
107
Group geo-tagging
108
Group geo-tagging
109
110
Batch edit
111
112
Bulk tagging
y ID collection of related content items by pattern or context y Then, apply same attributes to all content items
113
Tag a folder
y Drag & drop content items into folder y Then, content items inherit properties of folder
114
Workflow
y Approve & improve mindset
Publish
115
Interactive rewards
y Almost instantaneous exposure of tags in simple user
interfaces on the web provides positive reinforcement for user tagging that simply did not exist before. y For example,
Most popular Tag clouds Alerts
116
Most popular
Times.
117
Tag cloud
118
Alerts
y New (content selected by date) y Subscriptions (content selected by tags) y Interest (content selected by other people) y Individual (content selected for you by other people)
119
Taxonomy Strategies
LLC
HBSP. http://discussionleader.hbsp.com/davenport/2007/04/cause_and_effect_reporting_raw.html#comments
Taxonomy Strategies LLC The business of organized 122
Africa Americas Antarctica Asia Europe Oceania Global Historical geography Oceans & seas Regions
Agriculture Mining Utilities Construction Manufacturing Wholesale trade Retail trade Transportation & warehousing Information Finance & insurance Real estate Professional Management Administrative support Education Health care Arts, entertainment & recreation Accommodation & food Other services Public administration
Business entities Companies & brands Government agencies International NGOs Organization types
Basic facts & information Blog Brochure Database E-mail Letter Memo Multimedia Report Newsletter Podcast Press Release Research & Analysis RSS Feed
Taxonomy Facets Business activity Industry / Product Geography Organization Person / Role Content Type Audience
Tags
Topic
124
Summary
y There are lessons to be learned from web tagging about
how to get good metadata in document and content management applications. y Document and content management system tagging must be simple, and it must be almost instantaneously easier to find relevant work products.
125
Taxonomy Strategies
LLC
Questions?
Joseph A. Busch + 415-377-7912 jbusch@taxonomystrategies.com http://www.taxonomystrategies.com