Given a Graph, where we know

  • Total number of nodes (~100,000)
  • Average no of connections per node (~200)
  • Maximum distance between two nodes (~5)

How many nodes (and its connections) do we have to sample so that for 20% of node-pairs, we know the exact connection (any path or shortest path or a path less than 5 nodes away)? My idea right now is to simulate this. Is there anything better I can do?

To give a relevant example of possible application, let’s say, when you visit a LinkedIn profile, LinkedIn wants to suggest a person whom you can ask for introduction, a second or third or fourth degree contact who is a common contact for both of you.

After LinkedIn launches in India, how many people (out of ~100,000 working professionals in India) do they need on their platform (with all of their ~200 their contacts) before they can start doing this for 20% of profile visits?

Please note that I am in no way related to LinkedIn :)

Here is a related question (still unanswered): https://math.stackexchange.com/questions/2709294/what-is-probability-that-two-persons-are-befriended-in-a-social-network

New contributor
Soumendra is a new contributor to this site. Take care in asking for clarification, commenting, and answering. Check out our Code of Conduct.

Your Answer

Soumendra is a new contributor. Be nice, and check out our Code of Conduct.
 
discard

By clicking "Post Your Answer", you acknowledge that you have read our updated terms of service, privacy policy and cookie policy, and that your continued use of the website is subject to these policies.

Browse other questions tagged or ask your own question.