An evolutionary tree doesn’t add up
Starr, who joined Bloom’s lab to study HIV and quickly pivoted to SARS-CoV-2 when the COVID-19 pandemic struck, was inspired to study the origins of ACE2 binding in SARS-related coronaviruses (known as sarbecoviruses) after researching the hunt for the origins of SARS-CoV-1, which caused an epidemic in Asia in 2003.
“It was a 10-year process, and there was a debate the whole time,” he said.
Although many bat viruses with genetic similarity to SARS-CoV-1 were found across China during this time, they were genetically distinct, and none of these viruses were found to use ACE2 proteins as their entry receptor.
It wasn’t until 2013 that researchers announced they’d found a related bat coronavirus whose key fit the lock of the human ACE2, marking these bat viruses as a proximal source of the SARS-CoV-1 epidemic. Scientists worried about future spillovers intensified their searches for more ACE2-binding bat coronaviruses, which continued to be found in just a single province in Southern China, even though sarbecoviruses are found throughout the world.
Many bat sarbecoviruses in Asia, Europe, and Africa did not appear to use ACE2 as their entry point, leading researchers to conclude that it was a trait that had appeared late in coronavirus evolution.
But when Starr studied the tree of sarbecovirus evolution, he wasn’t convinced.
“SARS-CoV-2 sprung from a branch where we weren’t looking [for a new spillover event],” Starr said.
He suspected that the hyper-focus on ACE2-binding bat coronaviruses in one region of the world had blinded scientists to the origins of ACE2 binding in coronaviruses.
“It seemed like there was potential that using ACE2 is a trait writ large,” he said. “Using ACE2 in general could be a more widespread trait than was previously appreciated.”
What if the coronaviruses that didn’t use ACE2 to infect were the exception, instead of the rule?
ACE2 binding is an ancient characteristic
The region of the coronavirus spike protein that interacts with ACE2 is called the receptor-binding domain, or RBD. To assess whether ACE2 binding is a new or old characteristic, Starr amassed 45 RBD genes from across the four known, closely related subgroups of RBDs across the SARS-related coronavirus family. These subgroups, also known as clades, describe related RBDs that can be traced to the same branch point off the sarbecovirus evolutionary tree. SARS-Cov-2 and SARS-CoV-1 RBDs fall into different clades with closely related bat viruses, both in Asia. Starr’s survey included two RBDs from a clade of sarbecoviruses circulating in bats in Europe and Africa, which diverged from Asian sarbecoviruses hundreds or thousands of years ago.
Then, using a high-throughput system in which he enlists yeast to display just the RBD segment of the spike protein, Starr screened each RBD’s ability to bind ACE2 receptors from different host species, including human, civet, pangolin, mouse and two species of bat found in China.
As expected, nearly all RBDs in SARS-CoV-2 and SARS-CoV-1’s clades bound each species' ACE2 to a greater or lesser degree. But RBDs from the third clade of Asian sarbecoviruses didn’t bind any ACE2 from any species Starr tested, as had previously been suspected for this clade. And, in a first for any sarbecovirus found outside Asia, Starr saw that a bat virus from Kenya bound to two bat ACE2s.
That virus is from “a distinct clade on the evolutionary tree, and obviously, geographically, that's a much larger range that a single province in China, or a larger region in Southeast Asia, where ACE2 binding as a general property was thought to emerge,” Starr said.
Collaborating with University of Washington biochemist Dr. David Veesler, Starr confirmed that the Kenyan virus does use bat ACE2 to enter cells.
“This suggests that in Africa and Europe, viruses are probably also using ACE2, and it's this one clade in Southeast Asia that lost ACE2 binding that's so heavily sampled that is actually the outlier,” Starr said.
But because of how distantly related the Kenyan virus is to the Asian sarbecoviruses that scientists have focused on, Starr expects that many related but yet-to-be tested sarbecoviruses also use ACE2 as their entry point.
Though human ACE2 binding isn’t the only requirement for a sarbecovirus to trigger a pandemic, Starr’s findings suggests that scientists should change the geographic breadth of viral surveillance and perform wider, more careful sampling to monitor sarbecovirus spillover potential, he said.
Starr said that his study also can inform work on therapeutics and vaccines. To guard against a future spillover, researchers are working to design pan-sarbecovirus vaccines and therapeutic antibodies. But concentrating too much on SARS-CoV-1 and -2 and ignoring distantly related sarbecoviruses could leave us vulnerable to spillover events that may happen elsewhere on the sarbecovirus family tree, he said.
Starr used computational methods to reconstruct the ancestral RBD that existed before Asian and non-Asian sarbecovirus lineages diverged, and found that the ancestor also bound one of the bat ACE2 variants. According to Starr’s reconstruction, broader ACE2 binding arose in the ancestors of Asian sarbecoviruses as they diverged from the European and African viruses, then was lost in one clade as its ancestors diverged again from those that would give rise to SARS-Cov-1 and -2.