Professional Documents
Culture Documents
Admin
vim: (1, 3)
only: (1)
real: (1)
editor: (1, 4)
why: (2)
do: (2)
people: (2)
use: (2)
emacs: (2, 3)
evil: (3)
mode: (3)
nano: (4)
best: (4)
best: (4)
do: (2)
editor: (1, 4)
emacs: (2, 3)
evil: (3)
mode: (3)
nano: (4)
only: (1)
people: (2)
real: (1)
use: (2)
vim: (1, 3)
why: (2)
best: (4)
do: (2)
editor: (1, 4)
emacs: (2, 3)
evil: (3)
mode: (3)
nano: (4)
only: (1)
people: (2)
real: (1)
use: (2)
vim: (1, 3)
why: (2)
best: (4)
do: (2)
editor: (1, 4)
emacs: (2, 3)
evil: (3)
mode: (3)
nano: (4)
only: (1)
people: (2)
real: (1)
use: (2)
vim: (1, 3)
why: (2)
best: (4)
do: (2)
editor: (1, 4)
emacs: (2, 3)
evil: (3)
mode: (3)
nano: (4)
only: (1)
people: (2)
real: (1)
use: (2)
vim: (1, 3)
why: (2)
Q4: HITS
What does HITS stand for?
Q4: HITS
What does HITS stand for?
Hyperlinked-Induced Topic Search
How does the algorithm work?
It starts with the user's query to create the root set. It then builds the base set
from those pages. Once you have this focused subgraph, run the algorithm to
compute hub and auth scores
Q4: HITS
What are hubs and authorities?
Q4: HITS
What are hubs and authorities?
Hubs are central repositories - they have links to good authorities
Authorities are the sources of information - they are linked to by good hubs
Q4: HITS
How does HITS differ from PageRank?
Q4: HITS
How does HITS differ from PageRank?
HITS is based on the users query.
Each node maintains two scores - hub and auth
Each round requires an explicit normalization step
Q5: Pagerank
What does the .85 value for d represent?
If we assume most internet users are mobile, should we raise or lower the value of
d?
Q5: Pagerank
What does the .85 value for d represent?
This value represents the amount of time that a user clicks on a link. So, 85% of
the time they follow by clicking links, and 15% of the time they navigate to a new
page.
If we assume most internet users are mobile, should we raise or lower the value of
d?
Probably raise. Mobile users are more likely to follow links, and less likely to
navigate to new pages (because navigating to new pages requires typing in a
URL)
Q5: Pagerank
What are some issues with PageRank as a metric of page quality?
Q5: Pagerank
What are some issues with PageRank as a metric of page quality?
- Link farms, spam bots, etc. can skew rankings
- Links may not be meant as an endorsement, ie. social media shares
- Ajax and javascript can make traditional surfing difficult
- Content can be behind login - facebook feed is not searchable
A
E
C = .2
D = .2
C
Assume d=.85
E = .2
A
E
D = .15/5 + .85*(.2/2+.2/2) = .2
C
Assume d=.85
A
E
Assume d=.85