Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Page-2
Page-2
Page-3
Page-3
SMART/InSight History
2 of top 3 Japanese car manufacturers Top consumer electronics company Large nancial ins8tu8ons Chinas biggest eCommerce rm
2005: SMART InSight 1.1 2004: PlaRorm for custom solu:ons 2003: FAST Alliance
Page-4
Page-4
Smart Phone
Page-5
Page-5
Page-6
Page-6
Driver of innova/on
Power shift
Page-7
Page-7
Smaller record and index size enable faster index maintenance # of records per node: rule of thumb 10m vs. 2m Licensing & Maintenance Cost: less than
Scalability: 5x Cost Performance: 10x High Flexibility Lower Opera/ons Cost Faster Innova/on
Page-8
Enterprise Search expecta/ons Big data scale Security is important Disparate data: geography, systems, languages, format, structures KM is good to have, databases are cri:cal Support dierent users & usage: department, role, tasks High recall
Page-9
Page-9
Security
ACL security: complex requirements File System: le & folder level control CRM/ERP : Keeping ACLs up-to-date
Content
aggrega/on
Openpipeline Pypes
Page-10
Building
specialized
applica/ons:
Content
fusion
Content
fusion
from
disparate
data:
Page-11
Page-12
Append Pipeline
Tagging Pipeline
Data transforma:on: -key:key, key:value, eld names Query & Result transforma:on Boos:ng / Relevancy algorithm Security Mul:-Language support Federa:on & mashups
. . . . . .
Boos:ng
Transform
LWE Adapter
SolrAdapter
Other
Solr
Result Pipeline
Query Pipeline
Search Service
Content Security
Page-13
Schema independent widgets for analy:cs & visualiza:on Portalized Personalized: widgets, func:ons, content, elds
Page-14
Knowledge Centre has logs of all user ac:vity in SMART InSight This would be too costly with a commercial Search Engine and would not be feasible in a database
Prole users, groups and networks Personalize Recommenda:ons Create social ranking algorithms Usage analy:cs
Page-15
Ajax Portal
Personaliza/on
Benchmarking
Data Chain
Parts Catalog
Content Model
Claim Analysis
Page-16
>
$50
Billion
sales
/
year
>
800
Million
Items
>
370
Million
Users
Billions
of
clicks
per
day
Access
Log
Solr Hadoop
xxxxxxxx
Page-17
Broadcast Search
> 270 content sources: Socie/es, Associa/ons, Publishers & Open IEEE, ACM Elsevier, Wiley, Springer
Demonstra/on
Page-19
Contact Details
Page-20