Introduction

Hereditary ophthalmic diseases such as retinal dystrophies, congenital cataract, and other developmental malformations of the eye have considerable impact on social autonomy and quality of life. In particular, infants with congenital visual impairment show significant delays and difficulties regarding their motor, cognitive, language, educational, social, and emotional development compared with sighted children [1]. Molecular testing can help to provide a patient with early diagnosis and potential treatment, and is further required for accurate genetic counseling. Detection of disease-causing variants in genetic diagnostic and research testing has significantly improved with the introduction of next-generation-sequencing technologies (NGS), especially for rare diseases with a high level of clinical and genetic heterogeneity [2].

Within the last two decades, major progress has been made in our understanding of hereditary ophthalmic diseases. Inherited retinal dystrophies (IRDs), a major cause of childhood vision loss, are a heterogenic group of blinding diseases with more than 225 genes known to be mutated in the various clinical subtypes (https://sph.uth.edu/retnet/). Besides the genetic complexity, phenotypic overlaps and considerable variation in clinical expression may hamper a distinct clinical diagnosis [3]. The different disease entities result from variants in genes critical for development and maintenance of retinal photoreceptors or their supporting cells (for review, see ref. [4]). IRDs can be stationary or progressive (for review, see ref. [5]), and in the latter, vision worsens over time usually to the point of legal blindness, which can occur in early infancy (i.e. Leber congenital amaurosis) or later in life (i.e. retinitis pigmentosa or cone–rod dystrophy).

Congenital eye malformations result from defects in genes expressed during normal embryonic development, and are a significant cause of blindness worldwide [6]. Such developmental ocular defects comprise congenital cataract, congenital corneal anomalies (such as microcornea and sclerocornea), coloboma, microphthalmia (including nanophthalmos), anophthalmia and others, invariably leading to poor or no visual function and cosmetic disfigurement.

Most of these inherited ocular diseases are rare and a proper diagnosis requires expert physicians or multidisciplinary consultations and high-end specialized equipment for a differential diagnosis. However, high level medical infrastructure is expensive and not available in many parts of the world, leaving many patients without appropriate diagnosis and thus precludes proper counseling or knowledge of potential treatment options. Dropping prices for genome-wide genetic analysis such as exome sequencing may offer an economic alternative as a primary diagnostic measure in such scenarios.

Here we present the results of a genetic study on an unselected highly diverse spectrum of rare ophthalmic disorders in consanguineous Arab families from the Palestinian Authority and mainland Israel. Our results demonstrate the high sensitivity of current DNA-based diagnostics for ophthalmic disorders in this population even in families with poor prior clinical workup.

Materials and methods

Subjects

Study participants were recruited in Israel and the Palestinian Authority. In most instances, the patients and family members were recruited based on their self-reported symptoms and visual impairments upon visits of a non-ophthalmologist at the families’ home. Peripheral venous blood was taken from all consenting participants and used to isolate total genomic DNA applying standard procedures according to the manufacturer’s protocol (FlexiGene DNA Kit, QIAGEN, Hilden, Germany). DNA samples were sent to the University of Tuebingen (Germany) for genetic investigation. This study was performed according to the tenets of the Declaration of Helsinki and written informed consent was obtained from each study participant or from the parents in case of minor study subjects.

SNP genotyping and homozygosity mapping

SNP genotyping was done applying Affymetrix 250K NspI or CytoScan HD arrays (Affymetrix, Inc., Santa Clara, CA, USA). SNP chip genotypes were called with the Affymetrix Genotyping Console (GTC) Software v. 2.1 and used to determine homozygous regions by applying the online version of the HomozygosityMapper Software [7]. The threshold was set to a minimum of 2 Mb in length.

Exome sequencing

One affected patient per family underwent whole exome sequencing (WES). Exonic regions were enriched using the Agilent’s SureSelect XT Human All Exon v5 system (and in one case by using the SeqCap EZ Human Exome Library v2.0) according to the manufacturer's protocol (Illumina, San Diego, CA, USA). Sequencing was performed on Illumina HiSeq 2500 or NextSeq 500 instruments, and in one case on an Illumina Genome Analyzer IIx (GAIIx) instrument, in paired-end mode. WES was performed in 14 families (AQ10, AQ20, AQ21, AQ23, AQ24, AQ33, AQ34, AQ46, AQ48, AQ52, AQ53, AQ55, TR16, and MA08). For the (distantly) related families AQ33 and AQ34, WES was performed on one affected patient per family. In two families (AQ09 and TR21), variant detection was done by targeted NGS of >4800 genes applying the Illumina TruSight One Sequencing Panel (“clinical exome”) on Illumina MiSeq and HiSeq instruments. Raw NGS data were processed through an in-house pipeline (megSAP). Alignment was performed with BWA-MEM (https://arxiv.org/abs/1303.3997), freebayes (https://arxiv.org/abs/1207.3907) was used for variant calling, and SnpEff [8] was used for variant annotation. Variants were filtered for changes that are rare (minor allele frequency <1% due to The Genome Aggregation Database (gnomAD)), in homozygous state and in genes known/suggested to be associated with human diseases (“TruSight One Sequencing Panel filter”; 4813 genes).

Variant interpretation, Sanger sequencing, and segregation analysis

Variants identified by NGS were validated by PCR and subsequent Sanger sequencing using BigDye Terminator v. 1.1 chemistry and separation of sequencing products on a 3130xl Genetic Analyzer (Applied Biosystems, Weiterstadt, Germany). Primer sequences are listed in Table S1. Segregation analysis was done by Sanger sequencing with samples from all available family members. Minor allele frequency (MAF) data were obtained from The Genome Aggregation Database (gnomAD) (http://gnomad.broadinstitute.org/; accessed November 10, 2017). Variants were evaluated by the following in silico tools: MutationTaster (https://www.mutationtaster.org/) [9], PolyPhen-2 (https://genetics.bwh.harvard.edu/pph2/) [10], and SIFT (https://sift.jcvi.org/) [11], and classified in accordance with the standards and guidelines provided by the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) [12] with the help of the Genetic Variant Interpretation Tool https://www.medschool.umaryland.edu/genetic_variant_interpretation_tool2.html/). Variants were submitted to ClinVar.

Results

The inclusion criteria for subjects and families were defined as follows: (1) bilateral ophthalmic disease or a disorder with predominant visual symptomology, (2) at least two affected subjects per family, and (3) likely autosomal recessive inheritance with parental consanguinity. The study finally comprised 20 independent Arab families from Israel and the Palestinian Authority. In two instances we merged independently recruited families (AQ11 and AQ12 as well as AQ33 and AQ34), because we could establish familial relationship and affected patients presented with very similar conditions or symptoms. All families met the inclusion criteria with the exception of AQ13. In this family, distantly related to family AQ11/12, there was only a single affected individual suffering from a complex phenotype with ocular and neurodevelopmental symptoms. Only few families had prior undergone a thorough clinical work up. In fact, the majority of families were recruited through personal contacts within their communities by visits at their home places in order to inform them about the study, to interview the patients and family members about their visual and other health complaints and to collect blood samples. Therefore, mostly very limited knowledge on the condition (i.e. 'night blindness and gradual vision loss' or 'decreased vision and sensitivity to light') was available prior to the genetic investigation. Information on the origin, clinical diagnosis, and genetic analysis are provided in Table 1 and all pedigrees and genotypes are shown in Fig. 1.

Table 1 Overview of the cohort.
Fig. 1: Pedigrees of all studied families.
figure 1

Black arrows indicate index patients; gray arrowheads indicate genotype–phenotype discrepancies. The families AQ33 and AQ34 were distantly related and segregate the same mutation and therefore merged to a single family (AQ33/34). Numbering (no. 1−20) of the families corresponds to the numbering shown in Table 1.

Applying homozygosity mapping and subsequent candidate gene analysis or — during the later stage of this project — WES of a single affected patient per family, we identified the most likely disease-causing gene variants in 19 of the 20 (95%) independent families (Table 2). One family (AQ13) remained genetically unsolved.

Table 2 Disease-causing variants: evaluation and classification.

Most likely disease-causing variants were observed in 15 different already known disease-associated genes (Table 2). This includes several ultrarare disease genes such as PXDN in ANTERIOR SEGMENT DYSGENESIS 7 (ASGD7; OMIM 269400), VSX2 in MICROPHTHAMLIA (OMIM 610093), or SNX10 in autosomal recessive OSTEOPETROSIS (OMIM 615085) for which very few affected families have been reported hitherto. Moreover, in family TR16 presenting with foveal hypoplasia and infantile nystagmus, we identified a nonsense variant in a novel disease candidate gene (AHR) as reported in detail elsewhere [13]. Of 19 different disease-associated variants (one of which representing a complex allele) identified in the present study, 10 were novel at the time of identification (Tables 1 and 2). However, the variant (c.2230C > T;p.(Arg744Ter)) introducing a premature stop codon in CRB1 was meanwhile reported [14]. Disease-associated variants comprised seven different missense changes, five different variants introducing a premature stop codon, five different indel variants (small insertions and/or deletions; four of them causing a frameshift), and one large deletion encompassing five consecutive exons of the SPATA7 gene as well as upstream sequences. The Stargardt disease patients of one family (TR21) were shown to carry a complex ABCA4 allele (c.[5512C > G;c.5882G > A]). All disease-associated variants were present in homozygous state in affecteds.

Surprisingly, we found that deleterious variants in two collagen genes, COL9A2 and COL18A1, which are associated with Stickler and Knobloch syndrome, respectively, are segregating in family AQ48. COL9A2-linked Stickler syndrome and COL18A1-linked Knobloch syndrome are two ultrarare disorders with overlapping ophthalmic phenotype which—to the best of our knowledge—have never been described together in a single patient in the scientific literature before.

Discussion

In 19 of 20 consanguineous Arab families from Israel and the Palestinian Authority with diverse ophthalmic conditions and mostly no or only rudimentary clinical data, we identified disease-causing variants in 16 different genes, including a novel candidate gene for foveal hypoplasia and infantile nystagmus. The prevalence of hereditary recessive diseases in Arab societies is high, mainly because of a long tradition of consanguinity due to socio-cultural factors [15]. Consanguineous marriages in the Israeli Arab society were shown to account for up to 40% of all marriages, and most frequently occurred among first cousins [16, 17]. Therefore, complex relationships within individual families and affected family members in multiple generations are often observed. Although consanguinity and the application of homozygosity mapping, or filtering for homozygous variants in exome sequencing data, likely facilitated the identification of the disease-causing variants, the high diagnostic yield (95%) was remarkable given the heterogeneity of conditions and the mostly poor or absent prior clinical examination. Recruitment of the families was carried out over a period of 6 years, mostly by visits of a non-ophthalmologist at the families’ home (field study), and the patients and family members were enrolled according to their self-reported visual problems. Therefore we argue that a concept of 'reverse diagnostics' by WES-based genetic testing as a first tier approach can be highly efficient, and further represent a model for countries or people with limited access to specialized ophthalmic clinical services.

Genotyping of available family members by PCR and subsequent Sanger sequencing confirmed concordant segregation in all but three families (AQ06, AQ10, and AQ20; Fig. 1). False disease status information for some family members may be the most likely explanation, as well as accidental mislabeling or mix-up of samples during the field study. Other, less likely possibilities includes reduced penetrance or non-causality of the identified variants, either in the entire family or in individual subjects.

Five of the 12 distinct disease-associated variants in nine different genes associated with retinal dystrophies (isolated and syndromic) were novel. To of them — a nonsense variant in FAM161A in family AQ08 and a large deletion in SPATA7 in family TR11 — have already been described in detail elsewhere [18, 19].

In family TR21, three siblings had clinically been diagnosed with Stargardt disease, which was genetically confirmed by the identification of a homozygous ABCA4 variant. The ABCA4 complex allele (c.[5512C > G;5882G > A]) seen in this family has previously been observed in two siblings of a consanguineous Jordanian family, reported to have a severe, early-onset phenotype of Stargardt disease with severe macular atrophy and additional peripheral bone spicule-like pigmentary deposits as seen in retinitis pigmentosa [20].

Retinal dystrophy in four families (AQ33/AQ34, AQ24, AQ10, and AQ55) was CRB1-related. This is in line with a study by Beryozkin et al., showing that CRB1 variants are a relatively frequent cause of autosomal recessive early-onset retinal degeneration in the Israeli and Palestinian populations [21]. Furthermore, the only recurrent disease-associated variant in this study, CRB1: c.3307G > A; p.(Gly1103Arg), observed in two unrelated families (AQ10 and AQ55), was demonstrated to be a true founder variant in this population [21]. Disease-associated variants in CRB1 imply that the affected subjects in our families most likely suffer from severe retinitis pigmentosa (RP12; ref. [22]) or Leber congenital amaurosis [23], which is well compatible with the symptomology described by the families. However, a definite diagnosis would require a detailed clinical workup by an ophthalmologist in these cases.

Two families (AQ11/12 and AQ23) were shown to carry known disease-causing variants in CNGA3, a gene associated with achromatopsia or cone dystrophy [24]. Notably, clinical trials to evaluate safety and efficacy of gene supplementation therapy for CNGA3-linked achromatopsia are currently underway including one multi-site trial, NCT02935517, with a study site in Israel. This further emphasizes the value of this study and testing strategy to inform patients about clinical trials (usually at no costs for the patient) and identify patients with rare conditions as potential participants.

Family AQ11/12 has a further family branch (AQ13) with a single affected individual (AQ13-05). This subject had different symptoms (cerebellar atrophy with hearing loss and decreased vision) compared to his affected cousins in family branches AQ11 and AQ12, and did not carry the CNGA3 variant. AQ13-05 remained the only unsolved patient of this study although a separate WES analysis was done.

A novel nonsense variant in GPR98 (also designated as ADGRV1) was identified in a family with four children suffering from both visual and hearing impairment (AQ09). Variants in GPR98 account for a small but significant proportion of subjects suffering from Usher syndrome type 2 (6.4%). Most of these GPR98 variants, as in our case, result in a truncated protein product [25]. Of note, three of four affected siblings in AQ09 were shown to be heterozygous carriers of an additional complex USH2A allele (c.[3959C > T;12052G > A]), which was inherited from the mother). Both missense USH2A variants have already been reported in Arab-Muslim families in the Jerusalem region [26].

In family AQ20 with multiple affected patients reporting a defect of vision and night blindness, a novel frameshift variant (c.79delA; p.(Thr27Profs*26)) was identified in RLBP1 encoding the cellular retinaldehyde-binding protein (CRALBP). Variants in this gene have been reported to cause autosomal recessive retinitis pigmentosa [27] or retinitis punctata albescens [28], a subtype of retinitis pigmentosa characterized by night blindness, delayed dark adaptation, and the aggregation of small yellow-white dots in the posterior pole.

Another gene involved in the intraretinal regeneration of 11-cis-retinal, RDH5, was mutated in family AQ06. The observed missense variant (c.382G > A; p.(Asp128Asn)) has been reported in several prior studies [29,30,31] and was shown to strongly reduce protein stability in COS-1 cells [32]. Variants in RDH5 are associated with autosomal recessive Fundus albipunctatus (FA), characterized by night blindness and subretinal yellow-white spots [33]. In FA, night blindness is usually stationary and visual acuity is often normal which may compromise the ascertainment of the clinical status in a field study and explain non-concordant genotypes in this family.

Disease-associated variants in RPGRIP1 were first described in 2001 in a study involving 57 unrelated patients with Leber congenital amaurosis [34], and were also reported to cause a clinically somewhat milder cone–rod dystrophy [35]. In family AQ53, we observed a known frameshift variant (c.1107delA; p.(Glu370Asnfs*5)) (ref. [34]). The reported early disease onset in the affected family members is in accordance with a Leber congenital amaurosis-type of retinal dystrophy in family AQ53.

Affected family members of family AQ46 suffered from osteopetrosis with complete vision loss and were shown to carry a homozygous missense variant in the SNX10 gene. Variants in SNX10 are reported to cause autosomal recessive osteopetrosis type 8. Complete visual loss occurs invariably in all untreated patients [36]. The disease is frequently fatal during infancy unless treated by hematopoietic stem cell transplantation [37]. The SNX10 variant (c.284G > A; p.(Arg95His)) observed in this study has already been described in homozygous state in a female patient from consanguineous parents with autism spectrum disorder, developmental delay, and osteopetrosis [38].

Our study included two families with affected subjects presenting with distinct ocular malformations, who were found to carry homozygous missense variants in VSX2 and PXDN, respectively. The VSX2 variant c.679C > T; p.(Arg227Trp) has previously been reported in a large, consanguineous Arab family with four affected patients suffering from isolated microphthalmia/anophthalmia [39], consistent with the findings (atrophic eye balls, complete vision loss since birth) in family AQ52. Notably, VSX2-associated ocular malformations are almost exclusively reported from the Middle-East in families with parental consanguinity [40]. The novel PXDN missense variant (c.2459A > G; p.(Gln820Arg)) identified in family MA08 with congenital nanophthalmos and corneal opacification affects a highly conserved amino acid as shown in Fig. 2. It is also predicted to be disease-causing by MutationTaster, SIFT, and PolyPhen-2, and further absent from the gnomAD data set (Table 2). Prior reports describe similar ocular anomalies (cataract-microcornea with corneal opacity, microphthalmia, and anterior segment dysgenesis) due to variants in PXDN [41, 42].

Fig. 2
figure 2

Amino acid conservation of protein sequences affected by novel missense variants.

Stickler syndrome type V and Knobloch syndrome type I are caused by variants in the two different collagen genes COL9A2 and COL18A1, respectively. Besides a single 5-generation consanguineous family of Asian-Indian origin segregating autosomal recessive Stickler syndrome that has been identified to carry a biallelic 8-bp deletion in the COL9A2 gene leading to a premature termination codon (PTC) [43], only three more patients in two unrelated families have been shown to harbor biallelic disease-associated variants in this gene [44]. In the present study, we now identified a novel missense variant c.1774G > A; p.(Gly592Ser) affecting a highly conserved amino acid in the collagen type IX alpha 2 chain (Fig. 2) in family AQ48, likely representing the fourth case with Stickler syndrome type V due to a biallelic COL9A2 variant. The variant was predicted to be deleterious by all three in silico tools (MutationTaster, SIFT, and PolyPhen-2) and observed with a minor allele frequency of 8.802e−6 in gnomAD (Table 2). The mother (AQ48-01-02) was found to be homozygous for the COL9A2 variant and upon re-interview reported to suffer from severe hearing impairment since birth, requiring assistive devices. Note that neurosensory hearing loss is a consistent feature in Stickler syndrome type V. Family AQ48 additionally segregates a known 2-bp deletion (c.3514_3515delCT; p.(Leu1172Valfs*72)) in COL18A1 repeatedly reported in subjects with autosomal recessive Knobloch syndrome [45,46,47]. The index patient in AQ48 was homozygous for the COL9A2 as well as the 2-bp deletion in COL18A1, which is unique, as—to the best of our knowledge—Knobloch and Stickler syndrome have never been observed together in a single individual before. Accordingly, her phenotype was reported to be slightly more severe compared to other affected family members that were homozygous for either the COL9A2 or the COL18A1 variant, or were triallelic.

Defects in GALK1 were shown to be responsible for autosomal recessive congenital cataracts [48]. Galactokinase (GALK1) is an enzyme involved in the first step of galactose metabolism. Cataract formation in galactokinase-deficient individuals is the result of osmotic phenomena caused by the accumulation of galactitol, the reduction product of galactose through an alternative pathway, in the lens [49]. In family AQ21 with congenital cataract and a novel 1-bp deletion (c.678delT; p.(Asn226Lysfs*38)) in GALK1, extra-ocular symptoms (speech delay, hearing loss, late walking, huge tongue, and hypothyroidism) have been noted in the index patient. Severe speech delay was also reported in two galactokinase-deficient brothers further showing mental retardation and cataract [50]. In classic galactosemia (linked to defects in GALT) it has been shown that speech, motor coordination, and strength disorders, likely associated with diffuse cerebellar damage, co-occur [51]. Hypothyroidism can be congenital and accompanied by sensorineural hearing loss [52] and a large protruding tongue [53]. This triad of symptoms is unlikely to be caused by the 1-bp deletion in GALK1 given their absence in the affected sister.

In family TR16, a novel disease gene for autosomal recessive infantile nystagmus was identified as described in detail elsewhere [13]. The three affected family members were shown to be homozygous for a nonsense variant (c.1861C > T; p.(Gln621Ter)) in the aryl hydrocarbon receptor (AHR) gene encoding a ligand activated transcription factor involved in various physiological and developmental processes including neurogenesis. A knockout mouse model showing similar disease expression supports our hypothesis [54, 55]. This finding represents a novel disease entity characterized by foveal hypoplasia and infantile nystagmus.

In this study, we demonstrated that the genetic causes of highly heterogeneous ophthalmic disorders can be identified with high efficacy in consanguineous Arab families, even with no or sparse initial clinical data. Especially, exome sequencing could serve as an economic diagnostic approach, enabling efficient 'reverse diagnostics' and the opportunity for tailored care, rehabilitation, and treatment.