Knowledge rich The other extreme Web log mining User sessions Markov models But, sometimes data about specific pages are sparse Cannot train the markov models properly a single visitor views w0% of any site New dynamic content not in training data Now, many pages are generated automatically Deep web Dynamically generated pages Question: if we have the relational knowledge, what more can we do? 2021/1/26 12
2021/1/26 12 Knowledge Rich: The other extreme ◼ Web log mining ◼ User sessions ◼ Markov models ◼ But, sometimes data about specific pages are sparse! ◼ Cannot train the Markov models properly ◼ A single visitor views ~0% of any site ◼ New & dynamic content not in training data ◼ Now, many pages are generated automatically ◼ Deep web ◼ Dynamically generated pages ◼ Question: if we have the relational knowledge, what more can we do?
Relational markov models RMM( Relational What relation means Markov model Buys(Student, PC) Group the same type Student(ID, Name pages into relations Addr) Combine low-level and Relational Algebra is high-level information the basis for Automatically adapt relational databases Web sites for different users 2021/1/26 Dan Weld 13
2021/1/26 Dan Weld 13 Relational Markov Models RMM ( Relational Markov Model ) ◼ Group the same type pages into relations. ◼ Combine low-level and high-level information ◼ Automatically adapt web sites for different users ◼ What relation means ◼ Buys(Student, PC) ◼ Student(ID, Name, Addr) ◼ Relational Algebra is the basis for relational databases