So I spent a good amount of time trying to find the answer on how to do this. The only answer I have found so far is here: How to perform clustering without removing rows where NA is present in R
Unfortunately, this is not working for me.
So here is an example of my data (d in this example):
Q9Y6X2 NA -6.350055943 -5.78314068
Q9Y6X3 NA NA -5.78314068
Q9Y6X6 0.831273549 4.875151493 0.78671493
Q9Y6Y8 4.831273549 0.457298979 5.59406985
Q9Y6Z4 4.831273549 4.875151493 NA
Here is what I tried:
> dist <- daisy(d,metric = "gower")
> hc <- hclust(dist)
Error in hclust(dist) : NA/NaN/Inf in foreign function call (arg 11)
From my understanding daisy should be able to handle NA values, but I am still receiving an error when trying to cluster my results.
Thanks.