8:00 – 2:30 SR
- I’m thinking about how the edges in a network can have certain characteristics
- Bandwidth – the capacity of the edge. Can be expressed as used and potential
- Frequency – how often the edge is used. Bandwidth provides a ceiling on this
- Richness – I’m not sure how to think about this. In the most basic case, this could be an expression of information content (i.e. an infrequent transmission with lots of content is equivalent to a frequent transmission with low content). But when we layer on context and meaning, all kinds of additional information gets ‘compressed’ into the message – a glance can be nothing or everything. Ideally, the description of the edge should contain some way of accessing that context, and shouldn’t be considered simply a link
- Along this lines, I found this paper: Lexical chains as representations of context for the detection and correction of malapropisms. Lexical chains are certainly one way of representing context.
- Another way of thinking about richness is in terms of similarity. If the link is carrying the same payload over and over again, then there isn’t much richness. So can we look at similarity as a way of determining how rich a link is? An Information-Theoretic Definition of Similarity. And here’s looking at news headlines, which might be relevant to short posts or tweets: Similarity for news recommender systems
- And this reminded me of Princeton’s WordNet, which could be really helpful.
- Ok, back to the database
- Based on the above thoughts, I’m adding assoc_type to tn_associations
- Here’s a query that pulls all the parts together. According to everything I’ve read, joins are probably the most efficient way to do this (size and speed):
select u.login as `Link Created By`, a.created_on as `Created On`, at.name as `Assoc Type`, si.text as Source, st.name as `Source Type`, ti.text as Target, tt.name as `Target Type` from tn_associations a inner join tn_users u on a.user_id=u.uid inner join tn_types at on a.assoc_type = at.uid inner join tn_items si on a.source_id = si.uid inner join tn_types st on si.item_type = st.uid inner join tn_items ti on a.target_id = ti.uid inner join tn_types tt on ti.item_type = tt.uid;
- Now I need to load and save a named network from a particular user or any variants up to and including all networks from all users, without redundant nodes/edges…