Clustering of genotypes
Members within the same cluster are expected to be in a high close relationship in terms of the characteristics being studied than members from different clusters. D
2 clustering pattern deciphered that a total of 75 genotypes were grouped into 16 distinct clusters, displayed in Table 2 and Fig 1. Cluster II exhibited the greatest number of genotypes, consisting of 38, while cluster IV recorded the second maximum number of genotypes with 17. Cluster XI composed of five genotypes follows Cluster I, which contains three genotypes. The lowest number of genotypes was found in clusters III, V, VI, VII, VIII, IX, X, XII, XIII, XIV, XV and XVI, which had a single genotype each. The exotic germplasms, Hermes, Redwood 65 and AR-2 were classified into clusters, II, XVI and X, respectively. Similar patterns of clustering were corroborated by
Kant et al., (2011); Meena et al., (2021) and
Kumar and Kumar (2021). The intra-cluster and inter-cluster distance estimations of D
2 values were presented in Table 3. Cluster IV (14.34) has recorded the highest intra-cluster distance, followed by cluster II (13.20), cluster XI (12.78) and cluster I (6.31) in descending order. Only one genotype was present in clusters X, XI, VIII, VII, IV and III (0.00), which exhibited the lowest intra-cluster D value. The highest intra-cluster distance was recorded in descending order between cluster XI and cluster I (40.62), cluster V and XI (38.13), cluster V and cluster XIV (35.57), cluster I and cluster XIV (34.73) and cluster VI and cluster XV (33.98). Due to the greater inter-cluster distances among these clusters, crossing between these clusters can result in hybrids with better heterosis (
Acquaah, 2012). Similar types of results in diversity studies in linseed have been previously confirmed by
(Tewari et al., 2020; Nizar and Mulani, 2015;
Pali and Mehta, 2015). The lowest inter-cluster distance was recorded among clusters VI and X (8.79), then between VII and XII (9.99), III and VII (10.88), III and VIII (11.72) and cluster III and cluster XIV (11.73).
Cluster means (Average)
There was a considerable variation between 16 clusters concerning cluster average for distinct traits, as revealed by intra-cluster means ten characters shown in Table 4. Clusters XIV (89.00 days), cluster XIII (83.00 days), cluster X (82.00 days) and cluster VII (81.00 days) were recorded the highest average and clusters IX (64.00 days) and XV (65.00) had the lowest average for days for 50 percent blooming. The greater cluster mean for the primary branch was obtained for cluster X (7.67) followed by cluster IX (7.33), cluster VI (5.33) and cluster XIV (5.00). The lowest cluster mean was observed for cluster V (2.67). Clusters X (31), IX (30.33), XVI (29.00) and VI (20.67) showed the highest cluster average value in descending order for the secondary branch and cluster I (8.67) recorded the lowest average. For plant height, clusters I (104.00), XVI (102.67), V (79.00) and VI (75.33) were observed the highest cluster average and cluster XV (49.00) had the lowest average. The highest cluster average was observed in clusters VI (172.00), XI (165.80), X (156.33) and XVI (132.33) and the lowest cluster mean was in cluster XV (36.00) for capsules per plant. Cluster V (7.67) exhibited the lowest average and clusters VIII (9.33), III (9.00), XIV (8.59) and XI (8.93) for trait, seeds per capsule. When it comes to days taken for maturity, clusters XIV (146.33), XII (145.00), X (141.00) and I (140.67) demonstrated the highest cluster average and cluster VIII (126.00) indicated the lowest mean. Clusters IX (45.50), V (42.00), IV (39.24) and I (38.67) recorded the highest average oil content. In contrast, Cluster XIV (24.86) recorded the lowest mean. The maximum average of 1000-seed weight was observed in clusters VI (8.84), X (8.71), XII (8.52) and XIV (8.26). On the other hand, cluster XIV (4.18) had the lowest average weight. The character seed yield recorded the maximum average in cluster XIII (12.47) then in cluster X (10.12), cluster VI (10.09), cluster IX (9.37) and cluster XI (9.35). The clusters V and XIV (2.82) were noticed the lowest average.
Ranjana et al., (2019) and
Meena et al., (2021) have obtained similar results.
The contribution of different traits towards the divergence
According to average D
2, the highest contribution (presented in Table 5) was from the capsule number (35.75%) followed by plant height (19.96%), oil content (18.70%), seed yield (9.48%), test weight (8.72%). The other characteristics
viz., days needed for 50 per cent blooming, maturity duration, primary branch, secondary branch and seeds per capsule recorded negligible contribution. These characteristics, like the capsules per plant, plant height, grain yield and oil percentage should be emphasized to choose the appropriate parents for crossing.
Tewari et al., (2013) also followed the high contribution of capsule number and seed yield toward total genetic divergence.
Principal component analysis (PCA)
A multivariate approach (
Crossa, 1990) like Principal component analysis (PCA) is used to determine how the different traits contribute to overall variability and to offer a basis for choosing characteristics. The primary goal of PCA is to compress the total variation from studied variables into a smaller set of factors (
Sharma, 1998;
Brejda et al., 2000). In this study, a total of ten principal components (PCs) were extracted, equivalent to the number of traits studied and it revealed the four most informative PCs with eigenvalues of more than one which accounted for 68.38 per cent cumulative variance (Table 6). However, more than 50 per cent of the variance in the population was explained by three major PCs (PC1 25.88%; PC2-18.58% and PC3-13.12%).
Dabalo et al., (2020); Guei et al., (2005) confirmed these findings. As a result, parameters showed positive weight in the first three PCs considered to be more crucial
(Patial et al., 2019). The secondary branch number of each plant (0.54), primary branch number (0.49), capsule number (0.47), seed yield (0.45), days needed for 50 percent blooming (0.13) and maturity duration (0.07) recorded positive weightage in PC1 while other traits were negative weight (Fig 2B). In PC2, the parameters
viz., duration of 50 per cent blooming (0.32), seeds per fruit (0.27) and capsules (0.16) showed positive loading while other traits showed negative loading. In PC3, plant height (0.71), days needed for 50 per cent blooming (0.61) and maturity duration (0.31) showed highest positive loading and the remaining parameters showed negative loading. Generally, only a single component was chosen from these determined classes. The secondary branch of each plant was the better choice, as it exhibited the highest loading from PC1. Similarly, the time taken for 50 per cent blooming, plant length and seed number were best choices for the second, third and fourth principal components, respectively. PCA clearly demonstrated that the secondary branch of a plant, days to 50 per cent flowering, plant height and seeds per capsule were the most important traits showing a strong effect on total variation.
The scree plot illustrated the percentage difference between principal components and eigenvalues (Fig 2A). PC1 showed 25.88 percent variability with an Eigenvalue of 2.588 in this study. The first component (PC1) recorded the highest variance compared to other PCs (Fig 2A). Due to greater variance explained by the first component, genotypes selected from this group might be helpful in breeding programs for trait improvement. The selection of clusters from PC1 is highly beneficial in trait enhancement approaches, as it exhibits the greater variability (Fig 3). The biplot diagrams represent how the trait interacts and which genotypes perform better towards traits (Fig 2B).