Image source, Getty Images

  • Author, David Cox
  • Role, BBC News

Since the Human Genome Project was completed in 2003, scientists have been trying to locate new regions among the 3 billion letters of our genetic code that may play a critical role in disease.

With the help of technologies that allow whole-genome samples to be analyzed faster and cheaper than ever before, a large number of genome-wide association studies (GWAS) have been published that identify genetic variants linked to different chronic diseases.

For many geneticists, this has proven to be the easy part. The hard part has been understanding their relevance. For example, although GWAS have identified DNA segments associated with inflammatory bowel disease at 215 different chromosomal sites, scientists have only been able to determine the exact mechanisms involved at four of them.

One of the biggest challenges is that many of these DNA fragments are found in so-called genetic deserts.strips of the genome that seemed to contain nothing relevant, genetic “junk” that could be discarded. Less than 2% of the human genome is devoted to coding for genes that produce proteins, while much of the remaining 98% has no obvious meaning or purpose.