Write a pseudo code. The quest

Facebook Interview Question for Software Engineer / Developers

0

of 4 votes

14
Answers
Write a pseudo code. The question aims to test both your programming and analytical skills. Your implementation will be tested for both time and space efficiency.

The input format is strictly followed. Your program will be evaluated for correctness against multiple inputs. Some inputs will be very large (> 100,000 nodes).

Problem Statement

You are in-charge of the office jukebox. You are determined to do a very good job and make your colleagues happy. You ask them to email you a list of music bands they like. The number of bands each colleague likes is limited to 10,000.

Input

All input will be given on stdin. Your input will be of the form,

1. The first line will be an integer stating the number of lines of input.
2. The input will only contain alphanumeric characters, colon, comma, underscore and space [A-Z, a-z, 0-9, _, , ], +.
3. The first word will be the name of your colleague, followed by a colon.
4. A comma separated list of that person’s favourite bands will follow the colon.
5. Every line will be terminated by a newline character (\n).

An example input would look like:

6
Anne: Metallica, The_Doors, Black_Sabbath
John: The_Beatles, The_Doors, Metallica, Pink_Floyd
Kathy: U2, Guns_n_Roses, Led_Zeppelin
Jamie: Radiohead
Ashok: Guns_n_Roses, U2, Pink_Floyd, The_Doors
Sara: Blink_182, Iron_Maiden, The_Doors

Problem

You decide to use your data to find the people most compatible with each other. Two people are compatible if they have at least 2 bands in common. The compatibility of two people is directly proportional to the number of bands they like in common.

For each person in the list, output the most compatible person(s). If there is more than one compatible person, separate the names with a comma. If a person has nobody compatible, output nothing. For our example input, the output will be,

Anne: John
John: Anne, Ashok
Kathy: Ashok
Jamie:
Ashok: John, Kathy
Sara:
- br312@live.co.uk July 04, 2012 in United States | Report Duplicate | Flag | PURGE
Facebook Software Engineer / Developer Coding

Email me when people comment.

An error occurred in subscribing you.

Country: United States
Interview Type: In-Person

Email me when people comment.

An error occurred in subscribing you.

Comment hidden because of low score. Click to expand.

of 2 vote

PersonRequest
{
PersonName
SongList
}

List<PersonRequest> is input
Dictionary<SongName, List<PersonId>> songDict = new Dictionary
List<PersonName> PersonNameList
PersonId = 0
Foreach PersonRequest in input
{
	PersonNameList.Add(PersonRequest.PersonName)
	Foreach Song in PersonRequest.SongList
	{
		if(!songDict.Contains(Song))
			songDict.Add(Song, new List<PersonId>())
		songDict[Song].PersonList.Add(PersonId)	
	}
	PersonId++
}

// Probable candidate list
Dictionary<PersonId, Dictionary<PersonId, Count>> CandidateList;
Foreach Song in SongDict
{
	if(Song.PersonList.Count > 1)
	{
		Foreach PersonId in Song.PersonLit
		{
			if(!CandidateList.Contains(PersonId)
				CandidateList.Add(PersonId, new Dictionary<PersonId, Count>())
			ForEach SecondLevelPersonId in Song.PersonList
			{
				If(PersonId <> SecondLevelPersonId)
				{
					If(!CandidateList[PersonId].Contains(SecondLevelPersonId)
						CandidateList[PersonId].Add([SecondLevelPersonId], 0)
					CandidateList[PersonId][SecondLevelPersonId].Count++
				}
			}	
		}
	}
}

Foreach Candidate in CandidateList
{
	bool personHasCompatible = false
	strincg userToPrint = Candidate.name
	Foreach PersonCounts in Candidate
	{
		if(PersonCounts.Count >=2)
		{	
			personHasCompatible = true
			userToPrint.append(' ' + PersonCounts.Name)
		}
	}
	if(personHasCompatible)
		Print userToPrint
}

- swamy540@yahoo.co.uk July 04, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

I'd love it if you have also summarized your logic in algo form.

- sanjay.kr.maurya July 05, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

I think it is good , just PersonNameList is not being used anywhere

- pc July 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 1 vote

A simple way to do this is to maintain a hashMap of Map<String,Set<String>> where the key is the band name and the set of strings is basically the number of employees who have requested this band. This way we can easily invalidate those bands whose value count is lesser than 2 as nobody else likes this band.

Now with this reduced subset, we can convert it to a graph problem where the verticies are the music bands who have more than 1 recommendations and the edges are the employees. Now this is reduced to a spanning tree problem which can be efficiently solved

Space wise we have just a hashmap with the number of entries equal to the number of bands in this case the worst case scenario would be the case where all the 10000 bands requested by each employee will be different in which case we'd have n * 10000 entries where n is the number of employees. On the flip side there would be no processing required here as the reference count for each value would just be one.

There are many efficient ways to solve the graph problem when we need to process the data.

- keshi July 05, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

This is interesting to use a graph.

Can you explain why the vertices are bands and the edges are people? To me, it seems like the vertices should be people and the edges be bands, since we're interested in the connectivity between people.

But maybe this is just because I don't know much about spanning trees.

- vmayer99 July 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

i do also feels employees should be on vertices and edges should be band.

- Shiv July 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

what is the basis of turning this into minimum spanning tree problem?

- nutcracker July 06, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

not a minimum spanning tree
it will be just printing names of people on nodes which have 2 or more edges to current person

- Tushar July 07, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

using Hashing method , store all the band(key) of each colleuges and initialize the value with 0.
Whnever you are getting the given band name ,increment the value of the band name.
That band name whihc is getting the more number of value,that bands will be selected .

- aaa July 04, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

FP-Tree

- Anonymous October 24, 2012 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

is thre any standard algorithm for this problem???

- annonymous July 19, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

struct HashIntPair
{
size_t operator()(const pair<int,int>& intPair) const
{
std::hash<int> hasher;
return hasher(intPair.first) + 13*hasher(intPair.second);
}
};

int _tmain(int argc, _TCHAR* argv[])
{
ifstream ifs ("c:\\src\\example.txt");
if (!ifs.is_open())
{
return -1;
}

vector<string> UserNames;
unordered_map<string, vector<int>> AlbumToUsersMap;
unordered_map<pair<int, int>, int, HashIntPair> RelatedScoreMap;
unordered_map<int, vector<int>> RelatedUsers;

// process input
string temp;
getline(ifs, temp);
int numUsers = 0;
stringstream(temp)>> numUsers;
for (int i = 0; i < numUsers; ++i)
{
getline(ifs, temp);
stringstream liness(temp);
string name;
getline(liness, name, ':');
UserNames.push_back(name);
// read music
string album;
while(getline(liness, album, ','))
{
AlbumToUsersMap[album].push_back(i);
}
}
ifs.close();

// updated related scores
for (unordered_map<string, vector<int>>::iterator at = AlbumToUsersMap.begin(); at != AlbumToUsersMap.end(); ++at)
{
for (vector<int>::iterator it= at->second.begin(); it != at->second.end(); ++it)
{
for (vector<int>::iterator jt = it+1; jt != at->second.end(); ++jt)
{
pair<int,int> related = *it < *jt ? make_pair<int, int>(*it, *jt) : make_pair<int, int>(*jt, *it);

unordered_map<pair<int, int>, int>::iterator rtscore = RelatedScoreMap.find(related);
if (rtscore == RelatedScoreMap.end())
{
RelatedScoreMap[related] = 1;
}
else
{
++ (rtscore->second);
RelatedUsers[*it].push_back(*jt);
RelatedUsers[*jt].push_back(*it);
}
}
}
}

// output
ofstream ofs ("c:\\src\\exampleout.txt");
for (size_t i = 0; i < UserNames.size(); ++i)
{
ofs << UserNames[i] << " : ";
for (vector<int>::iterator it = RelatedUsers[i].begin(); it != RelatedUsers[i].end(); ++it)
{
if (it != RelatedUsers[i].begin()) { ofs << " , " ; }
ofs << UserNames[*it];
}
ofs << endl;
}

ofs.close();
return 0;
}

- AravindR July 29, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

this problem could be done in O(length of file) as follows:
maintain a trie and put the name of the bands into it. when finished putting a band into the trie, add the corresponding college name into the last trie node's college list.
each time you put a new band into the trie, you either found that this is a new band, or this band also belongs to some other college. so if you maintain a hash table for each college, you can find the required pairs in O(1).

- lambda2fei August 02, 2012 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

Go through the input once, build a hash table with the keys being the band name, values being a list of colleagues

Now go through the input again, for each band mentioned by each colleague, create a new hash table with key being other colleagues, value being the number of times they appear in the bands that the current colleague likes.

Now for each colleague, we just need to go through the new hash table and find the entry with the maximum value

Code can be tested here: ideone.com/4bgU7
One caveat is that the order of the compatibility list is not preserved (although it wasn't mentioned in the problem statement, but I can see that if there's a tie, output in the order same as the input colleague list).

import sys
 
def parseLine(line):
  parts = line.split(':')
  name = parts[0]
  bands = parts[1].rstrip().split(',')
  return name, bands
 
num_lines = int(sys.stdin.readline())
 
names = [] # to preserve the original order
colleague_likes = {}for line in sys.stdin:
  name, bands = parseLine(line)
  colleague_likes[name] = bands
  names.append(name)
 
band_likes = {}               
for item in colleague_likes.items():
  name = item[0]
  bands = item[1]
  for band in bands:
    if band_likes.has_key(band):
      band_likes[band].append(name)
    else:
      band_likes[band] = [name]
 
for name in names:
  bands = colleague_likes[name]
  compatibility = {}
  for band in bands:
    for colleague in band_likes[band]:
      if colleague == name:
        continue
      else:
        if compatibility.has_key(colleague):
          compatibility[colleague] = compatibility[colleague] + 1
        else:
          compatibility[colleague] = 1
     
  max_compatibility = 0
  compatibility_list = []
  for item in compatibility.items():               
    if item[1] < 2:
      continue
    if item[1] > max_compatibility:
      compatibility_list = [item[0]]
      max_compatibility = item[1]
    elif item[1] == max_compatibility:
      compatibility_list.append(item[0])
  print '%s: %s' % (name, ', '.join(compatibility_list))

- airfang613 August 05, 2012 | Flag Reply

CareerCup

Facebook Interview Question for Software Engineer / Developers

Books

Videos

Resume Review

Mock Interviews