This is a fantastic question and one which nobody can fully answer yet. Generally speaking, questions to do with taste i.e. why we like one thing and not another, are still in their infancy in terms of getting good answers. Nevertheless, small steps have been made to answering these questions. Here's what we do know:
Around 4 months into pregnancy, the ears are mostly developed and sound can be detected. The foetus will most oven hear the sound of it's mother's voice therefore and will likely find this type of sound (and any voice similar) will unconciously remind you of the most familiar (and therefore safe) sound you heard during development. There's a wealth of studies indicating that newborn babies respond more favourably to the sound of a mother's voice than to other voices, most recently in a 2012 study by Lesley Peltzer where it was found that girls engaged in stressful activities (the study involved solving math questions in front of an audience) the girls were calmed down better by hearing their mother speak over the phone, than by reading the same message sent from their mother in a text.
Similarly, a 2010 study carried out at the University of Minnesota found that students who played instruments tended to respond better to music that had similar frequencies to their instrument. In other words, our fondness for sound does seem to have a strong cultural influence. We tend to like sounds and music we're familiar with.
However, it's also worth noting that animals often have specific noises that are used to signify danger, food and a desire to breed. These calls are often the same, or similar, for different animals of the same species in different areas. Meaning there does also seem to be some evolutionary bias to like some sounds and not others. Most obviously, we don't like noises which gradually get louder and louder because it will give the impression of something approaching us (which immediately sets our instincts on alert for a predator). So ultimately, it's probably going to be a mixture of evolutionary preference for certain sounds, mixed in with what we were exposed to when we were young.