Genetic variability studies provide basic information regarding the genetic properties of the population based on which breeding methods are formulated for improving the targeted characters. The analysis of variance for both years indicated highly significant differences (p<0.001) among the genotypes for all the evaluated traits, indicating the presence of substantial genetic variability in the experimental material that can be further utilized for pigeon pea improvement programme. Variances for both years when compared, revealed non-significance differences and therefore, the data of both years can be combined for further analysis.
Association of different quantitative characters
Correlation analysis was done to find out interrelations among quantitative traits including yield and its component traits. Notably, significant Pearson’s correlation coefficients were observed for nearly all the traits, with variations noted when different origins were considered, contributing to the overall correlation. Traits such as DFF, DM, PH and HSW demonstrated significant and positive correlations with SYP (Fig 1). These distinguished traits exhibited considerable promise in the selection process of high-yielding genotypes within the pigeonpea species from the primary gene pool. These findings corroborate with the earlier studies conducted by
Verma et al., (2018a), Gaur et al., (2020) and
Ranjani et al., (2018), particularly regarding the attributes of DFF and DM. Among the component traits, PH showed significant correlations with six traits, namely NPB, NSP, DFF, DM, HSW and SYP followed by DM and SYP with four significant correlations each. These results highlight the crucial roles played by PH, DM and SYP in influencing yield in the present study, emphasizing the importance of prioritizing these traits in hybridization programs aimed at improving crop productivity.
It can be observed that the genotypes belonging to the north zone were the main determinant for the correlation to be significant between most of the traits. A highly significant positive correlation (p<0.001) was documented between DFF and DM (0.587***), followed by a correlation between DFF and SYP (0.323***). These results validated the earlier findings of
Pal et al., (2018); Vanniarajan et al., (2021) and
Reddy et al., (2023). Furthermore, a significant negative correlation was observed between NPB and DM (-0.095**), as well as between PH and HSW (-0.079*). Consequently, these traits hold potential for effective utilization either independently or in tandem, to augment the yield potential of this crop.
Multivariate analysis - Principal component analysis
Principal component analysis (PCA) is a multivariate statistical method for analyzing and simplifying complex and large datasets. Based on the association between studied characteristics and extracted clusters, the variation patterns in pigeon pea genotypes were investigated using PCA to determine the genotypes’ genetic diversity and relationship with the studied traits. PCA was conducted using data from ten quantitative traits and the outcomes are summarized in Fig 5. Hierarchical clustering on principal components (HCPC) was employed to elucidate the existing variability within the collection and to explore the similarities and differences between individuals based on the ten quantitative traits. HCPC, being a multivariate method, integrates PCA and clustering techniques to delineate stable morphological groups (Fig 4). Conse- quently, a PCA was conducted based on the ten quantitative traits and the first four principal components, each possessing eigenvalues greater than 1.0, collectively elucidated approximately 62.37% of the overall variation, as depicted in Fig 2. The PC-1 attributing to the most variability (18.8%), displayed notable positive Rotational component (RC) loadings with DFF (0.919) and DM (0.905). Consequently, these traits exerted considerable influence on the observed divergence and contributed substantially to the variability, aligning with previous findings (Fig 3).
The PC-2 accounted for 16.8% of the total variation. It was characterized by traits such as NPB, NSB, NPP and PH. PC-3 elucidated 15.1% of the variation and was associated with traits like HSW and SYP. PC-4 explained 11.7% of the total variation and featured traits like PL and NSP. PCA was utilized by
Manyasa et al., (2008); Zavinon et al. (2019) and
Upadhyaya et al., (2007) to find out the relative importance of different traits in the collections they analysed. Similar findings were reported by them in their study.
The clustering analysis performed on the principal components
i.
e. hierarchical clustering on principal components (HCPC) led to the classification of the 155 pigeon pea accessions into three primary clusters (Fig 4).
The individuals projected onto the axis system, defined by the first two principal components, displayed a mixed distribution among the genotypes. Notably, genotypes originating from various groups were associated with all three clusters, as illustrated on the individual factor map (Fig 4). This clustering pattern underscores a significant degree of variability within the different pigeon pea genotypes. Furthermore, the comparative analysis of phenotypic mean values revealed noteworthy variations among clusters for all quantitative variables, except for PL and NSP.
The principal component biplot, illustrating the quantitative traits among the pigeon pea genotypes, is presented in Fig 5. Two variables showed lower magnitude with shorter vector lengths,
i.
e., HSW and PL, whereas DFF and DM, with longer vector lengths, showed a higher magnitude (more variance) than the rest of the traits. In a biplot diagram, vector angles are key: the cosine of the angle approximates the correlation coefficient between two characters
(Yan and Kang, 2019). A <90
o angle indicates a positive correlation, >90
o indicates a negative correlation and 90
o signifies independence
(Yan and Rajcan, 2002). Four trait vectors
i.
e. NPB, NSB, PH and PL had small angles between them, indicating positive correlations. Similarly, DFF and DM also had a small angle between their vectors, signifying a positive correlation. However, DFF and DM formed angles greater than 90o with NPB, NSB, PH and PL, indicating negative correlations. Thus, smaller angles between vectors indicate stronger positive correlations among traits and larger angles indicate negative correlations. Notably, genotypes 132 (PA 676), 75 (PA 471), 151 (RVSA 2014-2), 74 (PA 470), 101 (PA 500), 84 (PA 483), 83 (PA 482), 22 (ICPL 8501s0) and 126 (PA 652)occupied distinct positions, situated far from the remaining genotypes in the biplot indicated their potential for further use in breeding programs (Fig 5). According to the principal component analysis, traits such as DFF and DM in PC-1 and traits such as the NPB, NSB, NPP and PH in PC-2, accounted for a significant portion of the variation.
Genotypes in the top right quadrant were closely related to PH, PL, NSB, NSP, NPP, HSW and SYP traits. The top left quadrant contained varieties related to the NPB trait. The bottom right quadrant included varieties associated with DFF and DM traits. Overall, the biplot analysis effectively visualized the relationships among genotypes and yield contributing traits.
Diversity assessment through D2 analysis and complete hierarchical clustering
The Mahalonobis D
2 values have classified the 155 pigeon pea genotypes into nine discrete clusters. Analysis of the average intra- and inter-cluster distances among these clusters (Table 1) indicated minimal genetic variation among genotypes within the same cluster across ten quantitative traits. The maximum intra-cluster distance of 29.53 was observed for cluster VIII. Conversely, significant genetic diversity exists between genotypes from different clusters. The maximum inter-cluster distance was obser-ved between clusters VI and VII (103.98), followed by clusters VI and VIII (91.13). Optimal genetic divergence is preferred between parental lines for hybridization to increase the likelihood of producing favourable segre-gants. Therefore, hybridization between genotypes from clusters with maximum inter-cluster distances is recomm-ended to enhance the probability of isolating desirable segregants in subsequent generations. The substantial inter-cluster distance in comparison to the intra-cluster distance indicated the presence of a considerable amount of genetic diversity among the studied genotypes. The present observation is in alignment with the earlier findings of
Verma et al., (2018b), Naing et al., (2022) and
Pushpavalli et al., (2017), who also noted a high magnitude of inter-cluster distance relative to intra-cluster distance. Clusters VI and VII, as well as clusters VI and VIII, demonstrated considerable divergence, suggesting that hybridization programs involving genotypes from these clusters are more likely to yield superior segregants or desirable combinations for the development of valuable genetic resources or varieties.
Analysis of genotype mean values across clusters (Table 2) highlights notable differences. Cluster VII shows the shortest maturity duration (137.37 days) and days to 50% flowering (75.53 days), making it ideal for developing short-duration varieties. In contrast, Cluster VI has the longest maturity duration (158.57 days) and days to 50% flowering (100.20 days). Cluster VI also boasts the highest number of primary branches (6.00) and pods per plant (169.30). Cluster VIII excels in secondary branches (14.75) and hundred-seed weight (12.18 g), while Cluster V has the tallest plants (246.59 cm). Pod length and the number of seeds per pod are highest in Cluster VI (5.58 cm and 4.47 seeds). Cluster IX exhibits the highest seed yield per plant (47.88 g). These findings suggest that genotypes of Clusters VI, VII and VIII are valuable for hybridization programs aiming to develop heterotic combinations, higher yield and short-duration varieties.
Analysis of the D
2 statistic revealed the traits contributing to genetic divergence (Table 2). The greatest contribution to genetic divergence comes from days to 50% flowering (27.92%), followed by hundred-seed weight (15.25%), seed yield per plant (10.77%) and plant height (8.27%). These traits collectively account for over 62% of the total divergence observed in the pigeon pea genotypes.Among the genotypes used in the present study, it was observed that cluster I contained the highest number of genotypes (87) followed by cluster II (21). The lowest number of genotypes were grouped in cluster IX containing only one genotype (Table 2).
A dendrogram was also constructed using Euclidean distance and the “ward” method of the hierarchical clustering of 155 genotypes into 6 distinct clusters. Ward method of clustering was the preferred method of choice since it had the highest correlation between cophenetic distance andthe original distance (0.34) when different methods of clustering were compared. The highest number of genotypes was in cluster III (43), followed by 36 genotypes in cluster II and the lowest number of genotypes was grouped in cluster V (16 genotypes) (Fig 6). The genotypes were categorized into six distinct clusters, providing an opportunity to identify divergent parental lines for hybridization and the establishment of novel pigeon pea breeding populations. There exists a necessity to introduce new genetic variations or incorporate genes from genetically divergent parents to capitalize on the genetic diversity observed within the evaluated pigeon pea genotypes.
The clustering of 155 genotypes was conducted using two distinct methods: clustering based on Mahalanobis D
2 values utilizing the modified Tocher’s method and hierarchical clustering employing the wardD method. In this study, both methodologies were compared, revealing an inability to consistently group genotypes of similar origins into the same clusters. While certain genotypes from the same origin were occasionally grouped together in both methods but the frequency of such occurrences was not substantial enough to draw a definite conclusion.
This suggested that further investigation is needed and consequently, it is inferred that the grouping of genotypes into distinct clusters primarily hinges on their genetic diversity rather than their geographical origins.