SAS/R calculate distance between two groups
I would like to calculate distance between two groups. I am very confused.
I have a two data sets. One is about a company and one is about employees.
I would like to find out how their age( a company in which an employee is hired and an employee) are similar or not. I think I need to standarize also..
- calcuate euclidean distance between each person and a company. (4-5 people in a company)
- calculate euclidean distance between each person and a company in industry level.
My dataset is like this:
person person_age company company_age industry 1 50 1 5 1 2 40 1 5 1 3 30 2 1 1 4 20 2 1 1 5 25 3 8 2
The following code will reproduce my data.frame:
person <- 1:5 person_age <- c(50,40,30,20,25) company <- c(1,1,2,2,3) company_age <- c(5,5,1,1,8) industry <- c(1,1,1,1,2) myData <- data.frame(person, person_age, company, company_age, industry)
Please help me. I don't mind using SAS or R. I am very confused.
So if you just want Euclidean distance on each person's age you can do something like:
d <- dist(myData[c("person","person_age")])
Your question is still incredibly malformed. For example, what does "calcuate [sic] euclidean distance between each person and a company" mean?
If you just want to know the distance between each person and the company they work for, you could code that as:
with(myData, ((person_age - company_age)^2)^.5 )
which is simply the 2d Euclidean distance formula