The following article is Open access

Planets Across Space and Time (PAST). III. Morphology of the Planetary Radius Valley as a Function of Stellar Age and Metallicity in the Galactic Context Revealed by the LAMOST-Gaia-Kepler Sample

, , , , , , , , , and

Published 2022 May 5 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Di-Chang Chen et al 2022 AJ 163 249 DOI 10.3847/1538-3881/ac641f

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

1538-3881/163/6/249

Abstract

The radius valley, a dip in the radius distribution of exoplanets at ∼1.9 R, separates compact rocky super-Earths and sub-Neptunes with lower density. Various hypotheses have been put forward to explain the radius valley. Characterizing the radius valley morphology and its correlation to stellar properties will provide crucial observation constraints on its origin mechanism and deepen the understanding of planet formation and evolution. In this paper, the third part of the Planets Across Space and Time series, using the LAMOST-Gaia-Kepler catalog, we perform a systematical investigation into how the radius valley morphology varies in the Galactic context, i.e., thin/thick galactic disks, stellar age, and metallicity abundance ([Fe/H] and [α/Fe]). We find the following: (1) The valley becomes more prominent with the increase of both age and [Fe/H]. (2) The number ratio of super-Earths to sub-Neptunes monotonically increases with age but decreases with [Fe/H] and [α/Fe]. (3) The average radius of planets above the valley (2.1–6 R) decreases with age but increases with [Fe/H]. (4) In contrast, the average radius of planets below the valley (R < 1.7 R) is broadly independent of age and metallicity. Our results demonstrate that the valley morphology, as well as the whole planetary radius distribution, evolves on a long timescale of gigayears, and metallicities (not only Fe but also other metal elements, e.g., Mg, Si, Ca, Ti) play important roles in planet formation and in the long-term planetary evolution.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

During the past quarter century, over 4000 planets have been identified, and thousands of candidates are yet to be confirmed (NASA Exoplanet Archive, EA hereafter; Akeson et al. 2013). A large sample of known exoplanets have opened doors to exoplanet statistical studies. One of the crucial questions is how planet properties depend on their host star properties, which provides important insights on understanding planet formation and evolution (Mulders 2018; Zhu & Dong 2021). For example, it has been well established that the occurrence rate of giant planets is strongly correlated to the stellar metallicity (Fischer & Valenti 2005; Wang & Fischer 2015), providing key evidence to the core accretion model on giant planet formation (e.g., Lissauer 1993; Pollack et al. 1996; Ida & Lin 2004a, 2004b). Furthermore, smaller planets of short orbital periods (e.g., hot Neptunes and super-Earths) are also found to be preferentially around metal-rich stars (Wang & Fischer 2015; Dong et al. 2018; Petigura et al. 2018), suggesting that metallicity plays an important role in the planet formation and orbital migration.

Recently, with the large planet sample provided by Kepler, statistical studies have revealed another important planetary feature, i.e., a radius valley (a paucity of planets around ∼1.9 R) that separates rocky, compact super-Earths and sub-Neptunes with lower bulk densities (Owen & Wu 2013; Fulton et al. 2017; Hardegree-Ullman et al. 2020). Until now, a number of theoretical models have been proposed to explain the radius valley. These models can be generally divided into two categories: evolutionary models and primordial models. On the one hand, from the view of evolutionary models, the radius valley is a result of the evolution of planetary radius distribution due to the loss of planetary atmosphere after planet formation. The energy source that drives the atmosphere loss process could be either from outside, i.e., the high-energy (e.g., X-ray) radiation of the host star (the photoevaporation mechanism; Owen & Wu 2013; Jin et al. 2014; Lopez & Fortney 2016; Owen & Wu 2017; Jin & Mordasini 2018), or from inside, i.e., the cooling luminosity of the central planet core (the core-powered mass-loss mechanism; Ginzburg et al. 2016, 2018; Gupta & Schlichting 2019, 2020). On the other hand, from the view of primordial models, the radius valley is a natural result from planet formation and migration. Some studies suggested that the valley emerged because of the formation of two distinct planet populations with two different core compositions, i.e., super-Earths with rocky cores that probably formed in situ, and sub-Neptunes with water/ice-rich cores that probably formed beyond ice lines but migrated into current orbits (Zeng et al. 2019; Venturini et al. 2020). Alternatively, some other studies found that the valley may be recovered by planets formed in situ with the same core composition if the cores have a broad initial mass function and accrete gas in gas-poor (but not gas-empty) nebulae (Lee & Chiang 2016; Lee & Connors 2021).

To better understand the origin of the radius valley and further constrain the above theoretic models, one may rely on more observational clues from the dependence of the radius valley on planetary and stellar properties (e.g., Rogers et al. 2021). Here we briefly summarize recent progress in this aspect.

  • 1.  
    Planet period dependence. The valley center is found to be anticorrelated with orbital period of planets (Fulton et al. 2017; Van Eylen et al. 2018; Martinez et al. 2019).
  • 2.  
    Stellar mass dependence. The valley center is positively correlated with stellar mass (Fulton & Petigura 2018; Wu 2019; Berger et al. 2020a).
  • 3.  
    Stellar environment dependence. Kruijssen et al. (2020) found that planetary systems around field stars exhibit a dearth of planets below the radius valley compared to systems in star clusters with high phase space densities.
  • 4.  
    Stellar metallicity dependence. Owen & Murray-Clay (2018) found that sub-Neptunes are larger and the radius valley is wider around metal-richer stars.
  • 5.  
    Stellar age dependence. Both Berger et al. (2020a) and Sandoval et al. (2021) found that the number ratio of super-Earth to sub-Neptune rises around older stars.

In addition, David et al. (2021) reported that the radius valley is emptier around younger stars using the California-Kepler Survey (CKS) sample (Johnson et al. 2017; Petigura et al. 2017). However, such a feature is not seen by Berger et al. (2020a) using the Gaia-Kepler catalog. Two potential reasons could cause such inconsistent results. First, estimation of stellar age is generally difficult and suffers considerable uncertainty. Both Berger et al. (2020a) and Sandoval et al. (2021) used isochrone fitting to derive age, whose uncertainty is large for main-sequence stars, e.g., ∼56% in the Gaia-Kepler catalog (Berger et al. 2020b). Second, age is generally correlated to other stellar properties; thus, the valley dependence on other stellar properties (e.g., metallicity) could affect the observed (apparent) result of the age dependence.

In this paper, we investigate the radius valley morphology in the Galactic context, focusing on its dependence on Galactic components (e.g., thin/thick-disk stars), metallicity (e.g., [Fe/H] and [α/Fe]), and age. This is the third paper of the series of Planets Across Space and Time (PAST). In the first paper of PAST (hereafter PAST I; Chen et al. 2021a), we extended the applicable range of the kinematic method for classification of Galactic components from the solar neighborhood (∼100–200 pc) to ∼1500 pc to cover most of planet hosts, and we refined the age–velocity dispersion relation (AVR) to derive kinematic ages of planet hosts with a typical uncertainty of ∼10%–20%. Applying the methods of PAST I, we constructed a LAMOST-Gaia-Kepler stellar catalog in the second paper of PAST (hereafter PAST II; Chen et al. 2021b), which provides kinematic properties and other basic stellar properties of 35,835 Kepler stars, including hosts of 1061 Kepler planets (candidates). The LAMOST-Gaia-Kepler stellar catalog of PAST II allows us to reveal the effects of various stellar properties ([Fe/H], [α/Fe], and age) on the radius valley, respectively.

The rest of this paper is organized as follows. In Section 2, we describe the sample selection of stars and planets (Section 2.1) and define several metrics to characterize the radius gap morphology (Section 2.2). In Section 3, we present the results of how the radius gap morphology is affected by Galactic component (Section 3.1), kinematic age (Section 3.2), [Fe/H] (Section 3.3), and [α/Fe] (Section 3.4). In Section 4, we discuss our results and their implications for planet formation and evolution. Finally, we summarize the paper in Section 5.

2. Methods

2.1. Sample Selection: Stars and Planets

We initialized our stellar sample from the LAMOST-Gaia-Kepler catalog from PAST II (Chen et al. 2021b), a total of 35,835 Kepler stars, including stellar hosts of 1060 Kepler planets (candidates). In addition to some basic stellar parameters (e.g., mass, radius, effective temperature, [Fe/H], [α/Fe]), the catalog provides stellar kinematic properties, such as the Galactic velocities and the derived relative membership probabilities among different Galactic components (TD/D, TD/H, Herc/D, Herc/TD), where D, TD, Herc, and H denote the thin disk, the thick disk, the Hercules stream, and the halo of the Milky Way, respectively.

For the star sample considered in this paper, we first excluded giant stars by eliminating stars with ${\mathrm{log}}_{10}\tfrac{{R}_{* }}{{R}_{\odot }}\,\gt 0.00035\times ({T}_{\mathrm{eff}}-4500)+0.15$ according to Fulton et al. (2017). The stellar mass, radius, and Teff are adopted from the Gaia-Kepler stellar properties catalog of Berger et al. (2020b). Following Bensby et al. (2014), we then adopted the criteria "Herc/D < 0.5 & Hec/TD < 0.5 & TD/H > 1" to keep only stars in the Galactic disk because kinematic age only applies to stars belonging to the Galactic disk components as suggested in PAST I (Chen et al. 2021a). Finally, we removed stars without [Fe/H] or [α/Fe] measurements.

For the planet sample, we excluded planets of grazing transits (i.e., $\tfrac{{R}_{p}}{{R}_{* }}+b\gt 1$, where b is the transit impact parameter in the Kepler DR25 table from the NASA exoplanet archive (NASA Exoplanet Archive 2021) and R p is calculated by multiplying stellar radius and the radius ratio of planet to star) because their radius measurements generally suffer from large uncertainties. To ensure the detection efficiency of super-Earths, we only kept planets with orbital period less than 100 days. We also excluded ultra–short-period planets (USPs, planets with orbital period less than 1 day) because they are relatively rare and likely to be formed differently as compared to the bulk of Kepler planets (Winn et al. 2018). Further, since we mainly focus on planets close to the radius valley, we only kept planets with radii in the range of 1–6 R. After the above selections, we are left with 446 stars hosting 621 planets (candidates). Table 1 is a rundown of the above sample selection process.

Table 1. Rundown of the Sample Selection

Selection Criteria N s N p
LAMOST-Gaia-Kepler sample (PAST II)7641060
Excluding giant stars7471040
Belong to Galactic disks659919
With [Fe/H] and [α/Fe] measurements605850
$\tfrac{{R}_{p}}{{R}_{* }}+b\leqslant 1$ 589833
Period ≤ 100 days544767
Period ≥ 1 days532753
1 R < R p < 6 R 446621

Note. N s and N p are the numbers of host stars and planets, respectively, during the process of sample selection in Section 2.1.

Download table as:  ASCIITypeset image

Figure 1 displays the color-coded distributions of relative probabilities between thick disk and thin disk (TD/D) in the [Fe/H] − [α/Fe] plane for our stellar sample. As can be seen, in general, stars with larger TD/D (kinematically thicker) have lower [Fe/H] and higher [α/Fe], which is consistent with previous studies (e.g., Bensby et al. 2014; Chen et al. 2021a).

Figure 1.

Figure 1. The [Fe/H]–[α/Fe] diagram color-coded by the relative probability between thick disk and thin disk TD/D for the stellar sample (Section 2.1).

Standard image High-resolution image

According to previous studies (e.g., Owen & Wu 2013, 2017; Fulton et al. 2017; Fulton & Petigura 2018; Zhu & Dong 2021), the radius valley is located at ∼1.9 ± 0.2 R. We then divide our planet sample into four subsamples according to their radii:

  • 1.  
    Valley planet (VP): 1.7–2.1 R (NVP = 70).
  • 2.  
    Super-Earth (SE): 1.0–1.7 R (NSE = 266).
  • 3.  
    Sub-Neptune (SN): 2.1–3.5 R (NSN = 238).
  • 4.  
    Neptune-size planet (NP): 3.5–6 R (NNP = 47).

Here NVP, NSE, NSN, and NNP are the numbers of planets in the corresponding subsamples.

Figure 2 shows the period–radius distributions of our planetary sample. As can be seen, there exists an obvious bimodal distribution in planetary radius and a valley in ∼1.9 R, which is well consistent with previous studies (e.g., Fulton et al. 2017). In the subsequent sections, we will further characterize the morphology of the radius valley and explore its dependence on various stellar properties.

Figure 2.

Figure 2. The period–radius diagram for the planetary sample (Section 2.1). Areas of different colors represent planets of different radius: green for super-Earths (1–1.7 R), gray for valley planets (1.7–2.1 R), yellow for sub-Neptunes (2.1–3.5 R), and purple for Neptunes (3.5–6 R). Histograms of planetary radius (R p ) and orbital period (P) are shown in the right panel and the top panel, respectively.

Standard image High-resolution image

2.2. Characterizing the Radius Valley Morphology

To characterize the morphology of the radius valley, we adopted a set of metrics, which are defined as follows:

  • 1.  
    The contrast of radius valley Cvalley, defined as the number ratio of super-Earths plus sub-Neptunes to the valley planets, i.e.,
    Equation (1)
  • 2.  
    The asymmetry on the two sides of the valley defined as the number ratio (in logarithm) of super-Earths to sub-Neptunes, i.e.,
    Equation (2)
  • 3.  
    The average radius of planets with size larger than valley planets, i.e.,
    Equation (3)
    where 2.1 R <R i < 6 R.
  • 4.  
    The average radius of planets with size smaller than valley planets, i.e.,
    Equation (4)
    where 1.0 R <R i < 1.7 R.
  • 5.  
    The number fraction of Neptune-size planets in the whole planet sample, i.e.,
    Equation (5)
    This metric reflects the extension degree of the second (with larger radius) peak beside the radius valley.

To obtain the uncertainties of these metrics, following Berger et al. (2020a), we performed Monte Carlo simulations by resampling the planet radii from a normal distribution given their measured values and uncertainties. We then recounted NVP, NSE, NSN, and NNP and computed those metrics, i.e., Avalley, Cvalley, Rvalley +, ${R}_{\mathrm{valley}}^{-}$, and fNP, for the resampled data. We repeated the above procedure 10,000 times and then calculated the 84.1 percentiles and 15.9 percentiles in the resample distributions as the upper and lower uncertainties of the corresponding metrics.

3. Results

In this section, we explore how the radius valley morphology (Equations (1)–(5)) depends on Galactic component (e.g., thin/thick disk; Section 3.1), kinematic age (Section 3.2), [Fe/H] (Section 3.3), and [α/Fe] (Section 3.4).

3.1. Dependence of Radius Valley on Galactic Component

In this paper, we only considered the Galactic disk components, i.e., thin and thick disks, and used the relative probability TD/D to quantify the likelihood that a star belongs to the thick disk relative to the thin disk. The higher the TD/D is, the more likely the star is to belong to the thick disk than the thin disk. We investigated the radius valley morphology as a function of TD/D. To do this, we divided our sample into four bins according to TD/D. Specifically, we first sorted the whole sample in the order in which TD/D increases. We took 10% (44) of stars at the lower TD/D end as the first bin and divided the rest into three bins of equal size (134) according to their TD/D. To remove the effects of other stellar parameters and planet detection efficiency on different bins, the last three bins were further controlled to let them have similar distributions in stellar mass, radius, and photometric precision (i.e., combined differential photometric precision (CDPP)) as compared to the first bin. This was realized by adopting the NearestNeighbors method in scikit-learn (Pedregosa et al. 2011) to select the nearest neighbor in the space of the controlled parameters from stars in the last three bins for every star in the first bin (see Appendix A for the details of the construction and validation of the control bins). After the above parameter control process, the last three bins have similar distributions in stellar mass, radius, and CDPP as compared to the first bin (Kolmogorov–Smirnov (K-S) test P-value greater than 0.8 as shown in Figure A2). We stress that this method removes the difference in sample completeness between bins, but it does not derive the intrinsic distribution in each bin by directly applying completeness corrections, so our statistical results are meaningful in the differential sense among various bins. Note that here we did not control [Fe/H] and [α/Fe] because the differences in stellar metallicity between thin- and thick-disk stars are essential and inevitable.

Figure 3 displays the period–radius diagram (top panel) and the radius distribution (bottom panel) of planets in different TD/D bins (after parameter control). As expected and printed at the top of the figure, with the increase of TD/D (i.e., from thin disk to thick disk), the kinematic age and [α/Fe] increase, while the [Fe/H] decreases. However, there seems no apparent pattern between radius valley and TD/D. Such an intuition is confirmed in Figure 4, in which we plot the five metrics (Cvalley, Avalley, ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP, Equations (1)–(5)) of radius valley morphology as a function of TD/D. As can be seen, with the increase of TD/D, Cvalley and Avalley seem to first increase and then decrease, while ${R}_{\mathrm{valley}}^{-}$ and fNP first decrease and then increase. We have also adjusted the size of the first bin (e.g., 15%) and found that the evolution trends of the five metrics of radius valley morphology generally maintain, demonstrating that our results are not (significantly) dependent on the selection of first bins.

Figure 3.

Figure 3. The period–radius diagram (top panel) and radius distribution (bottom panel) of planets in different TD/D bins from our sample based on the LAMOST-Gaia-Kepler catalog. Different colors represent planets of different radius: green for super-Earths (1–1.7 R), gray for valley planets (1.7–2.1 R), yellow for sub-Neptunes (2.1–3.5 R), and purple for Neptune-size planets (3.5–6 R). The median values and 1σ uncertainties of kinematic age, [Fe/H], and [α/Fe] of each bin are printed at the top.

Standard image High-resolution image
Figure 4.

Figure 4. The five statistics to characterize the radius valley morphology (i.e., Cvalley, Avalley, ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP) as functions of the relative probability between thick disk and thin disk (TD/D). In Section 4.1, we rescale the planet radii to eliminate the influence of stellar mass and planetary orbital period on the boundary of radius valley. The results with rescaling radii are plotted as open circles and shifted to the right to avoid overlapping.

Standard image High-resolution image

However, there is no clear monotonic trend between TD/D and any of the morphology metrics owing to such small number statistics. Such a nonmonotonic result is not unexpected because thick-disk stars are intrinsically older with lower (higher) [Fe/H] ([α/Fe]) compared to thin-disk stars. Thus, the results of increasing TD/D are essentially a combination of the effects of growing age, decreasing [Fe/H], and increasing [α/Fe]. In the following subsections, we will investigate these effects separately.

3.2. Dependence of Radius Valley on Age

In this subsection, we focus on the effect of age on the radius valley morphology. Similar to the parameter control method in Section 3.1, we further control [Fe/H] and [α/Fe] to isolate the effect of age as shown in Figure A3.

We estimated the average age of each bin with the kinematic method by using the age–velocity dispersion relation (AVR) refined by PAST I (Chen et al. 2021a). In Figure 5, we show the period–radius diagram (top panel) and radius distribution (bottom panel) of planets in each bin. The median values and 1σ uncertainties of kinematic age, [Fe/H], and [α/Fe] of each bin are printed at the top. From left to right, the age increases monotonically and significantly, while [Fe/H] and [α/Fe] are almost unchanged among different bins, which is expected after the above parameter control. Some apparent age trends emerge in Figure 5. As can be seen, the radius valley looks more prominent in older bins, and the number ratio of super-Earths to sub-Neptunes seems to increase with age. In addition, the radius range of planets above the radius valley is shrinking over the age.

Figure 5.

Figure 5. The period–radius diagram (top panel) and radius distribution (bottom panel) of planets with different kinematic ages t (Gyr) from our selected sample from the LAMOST-Gaia-Kepler catalog. The median values and 1σ uncertainties of kinematic age, [Fe/H], and [α/Fe] of each bin are printed at the top.

Standard image High-resolution image

We further investigate the above apparent trends quantitatively by calculating the five radius morphology metrics (Equations (1)–(5)) in each bin. Figure 6 shows these metrics, i.e., Cvalley, Avalley, ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP, as a function of age from the top to bottom panels. For each panel in Figure 6, we fit the data points with two models, i.e., a constant model (y = C) and a linear model with a logarithm timescale ($y=A\times {\mathrm{log}}_{10}(t)+B$), or in log–log space (${\mathrm{log}}_{10}(y)=A\times {\mathrm{log}}_{10}(t)+B$). We adopted two approaches to evaluate the two models. First, the simplest one is to calculate the Akaike information criterion (AIC) score of the constant and linear models, AICcon and AIClin. In terms of the residual sum of squares (RSS), the AIC can be calculated with the following formula (Cavanaugh 1997):

Equation (6)

where n is the number of data points and k is the number of model parameters. We then calculate the difference in the AIC scores (ΔAIC ≡ AICcon − AIClin) of the two models, and the model with the smaller AIC is preferred. Second, we adopted a Monte Carlo simulation by resampling the data points given their uncertainty and refitting the data 10,000 times. The confidence level of the best-fit parameter is calculated as the fraction of these resample-fit trials in which the model has a lower AIC score. The fitting results are as follows:

  • 1.  
    For Cvalley, the linear increasing model is preferred over the constant model with a ΔAIC = 7.65 and a confidence level of 95.21% from Monte Carlo simulation. The best fit is
    Equation (7)
  • 2.  
    For Avalley, the linear increasing model is preferred over the constant model with a ΔAIC = 5.66 and a confidence level of 96.57% from Monte Carlo simulation. The best fit is
    Equation (8)
    Although we simply fit a linear function here owing to limited data points, we note that the increase of Avalley (or the number ratio of super-Earths to sub-Neptunes equivalently) seems nonlinear; it is mainly completed within ∼1–2 Gyr (see more discussions in Section 4.2.1).
  • 3.  
    For ${R}_{\mathrm{valley}}^{+}$, the linear decay model is preferred over the constant model with a ΔAIC = 12.60 and a confidence level of 99.71% from Monte Carlo simulation. The best fit is
    Equation (9)
  • 4.  
    For ${R}_{\mathrm{valley}}^{-}$, the constant model is preferred over the linear model with a ΔAIC = 2.13 and a confidence level of 99.32% from Monte Carlo simulation. The best fit is
    Equation (10)
  • 5.  
    For fNP, the linear decreasing model is preferred over the constant model with a ΔAIC = 10.12 and a confidence level of 99.85% from Monte Carlo simulation. The best fit is
    Equation (11)

Figure 6.

Figure 6. The five statistics to characterize the radius valley morphology (i.e., Cvalley, Avalley, ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP) as functions of kinematic age t (Gyr). In Section 4.1, we rescale the planet radii to eliminate the influence of stellar mass and planetary orbital period on the boundary of radius valley. The results with rescaling radii are plotted as open circles and shifted to the right to avoid overlapping.

Standard image High-resolution image

Above quantitative results confirm the apparent trends seen in Figure 5. With increasing age over gigayear timescales, the radius valley becomes more prominent (i.e., larger Cvalley) and more asymmetric (larger Avalley or larger number ratio of super-Earths to sub-Neptunes); the average radii of planets larger than valley planets decrease (with a slope of $d(\mathrm{log}{R}_{\mathrm{valley}}^{+})/d(\mathrm{log}t)\,\sim -0.07$), while the average radii of planets smaller than valley planets (i.e., super-Earth) changes little; and the fraction of Neptune-size planets fNP decreases significantly. The implications of these results on planet formation and evolution will be discussed in Section 4.3.

3.3. Dependence of Radius Valley on [Fe/H]

In this subsection, we explore the dependence of radius valley on [Fe/H]. Similar to the parameter control method in Section 3.1, we controlled the parameters, i.e., stellar mass, radius, CDPP, [α/Fe], and TD/D, to isolate the effect of [Fe/H]. After the parameter control process, stars in different bins have similar distributions in these controlled parameters (with all K-S test p-values larger than 0.2) as shown in Figure A4. Note that we were not allowed to control stellar age directly. In fact, age has been controlled (see the ages printed at the top of Figure 7) indirectly by controlling TD/D because TD/D and age are strongly correlated to each other (Chen et al. 2021a).

Figure 7.

Figure 7. The period–radius diagram (top panel) and radius distribution (bottom panel) of planets with different metallicity [Fe/H] from our selected sample from the LAMOST-Gaia-Kepler catalog. The median values and 1σ uncertainties of kinematic age, [Fe/H], and [α/Fe] of each bin are printed at the top.

Standard image High-resolution image

In Figure 7, we compare the period–radius diagram (top panel) and radius distribution (bottom panel) of planets in different [Fe/H] bins after parameter control. As printed at the top, the median values of [Fe/H] decrease continuously, while the kinematic age and [α/Fe] are nearly unchanged within their uncertainty, demonstrating that these parameters have been well controlled. Some apparent trends emerge in Figure 7. As can be seen, the radius valley seems to be filled up as [Fe/H] decreases. There seem to be more sub-Neptunes relative to super-Earths in the Fe-rich bins than in the Fe-poor bins. In addition, the radius range of planets above the radius valley is shrinking from Fe-rich bins to Fe-poor bins.

To quantify the above trends, we further calculated the five radius valley morphologies (Equations (1)–(5)) for the four bins of different metallicity [Fe/H]. In Figure 8, we show these metrics, i.e., Cvalley, Avalley, ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP, as a function of [Fe/H] from the top to bottom panels. We fit the data points in each panel with two models, i.e., a constant model (y = C) and a linear model (y = A × [Fe/H] + B), or in a logarithm ordinate scale ${\mathrm{log}}_{10}(y)=A\times [\mathrm{Fe}/{\rm{H}}]+B$). Then, following the same procedures as in Section 3.2, we evaluated these two models by calculating the difference in AIC score (ΔAIC) and the confidence level from Monte Carlo simulation. The fitting results are as follows:

  • 1.  
    For Cvalley, the linear increasing model is preferred over the constant model with a ΔAIC = 10.13 and a confidence level of 95.72% from Monte Carlo simulation. The best fit is
    Equation (12)
  • 2.  
    For Avalley, the linear decay model is preferred over the constant model with a ΔAIC = 10.51 and a confidence level of 99.92% from Monte Carlo simulation. The best fit is
    Equation (13)
  • 3.  
    For ${R}_{\mathrm{valley}}^{+}$, the linear increasing model is preferred over the constant model with a ΔAIC = 10.70 and a confidence level of 99.46% from Monte Carlo simulation. The best fit is
    Equation (14)
  • 4.  
    For ${R}_{\mathrm{valley}}^{-}$, the constant model is preferred over the linear model with a ΔAIC = 1.87 and a confidence level of 97.28% from Monte Carlo simulation. The best fit is
    Equation (15)
  • 5.  
    For fNP, the linear increasing model is preferred over the constant model with a ΔAIC = 4.71 and a confidence level of 95.23% from Monte Carlo simulation. The best fit is
    Equation (16)

Figure 8.

Figure 8. The five statistics to characterize the radius valley morphology (i.e., Cvalley, Avalley, ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP) as functions of [Fe/H]. In Section 4.1, we rescale the planet radii to eliminate the influence of stellar mass and planetary orbital period on the boundary of radius valley. The results with rescaling radii are plotted as open circles and shifted to the right to avoid overlapping.

Standard image High-resolution image

Above quantitative results confirm the apparent trends seen in Figure 7. With the increase of [Fe/H] the radius valley becomes emptier; the ratio of super-Earths to sub-Neptunes decreases significantly; the average radii of planets larger than valley planets increases continuously with a slope of $d\mathrm{log}{R}_{\mathrm{valley}}^{+}/d[\mathrm{Fe}/{\rm{H}}]\sim 0.1$, while the average radius of smaller planets, i.e., super-Earth, is almost unchanged; and the fraction of Neptune-sized planets increases significantly. The implications of these results on the formation mechanisms of the radius valley will be discussed in Section 4.3.

3.4. Dependence of Radius Valley on [α/Fe]

In this subsection, we study the link between the radius valley and [α/Fe]. Similar to the parameter control method in Section 3.1, we controlled the parameters, i.e., stellar mass, radius, CDPP, [Fe/H], and TD/D, to isolate the effect of [α/Fe]. After the parameter control process, stars in different bins have similar distributions in these controlled parameters (with all K-S test p-values larger than 0.15) as shown in Figure A5.

Figure 9 shows the period–radius diagram (top panel) and radius distribution (bottom panel) of planets in different [α/Fe] bins after parameter control. As printed at the top, from left to right panels, the median values of [α/Fe] increase continuously, while the kinematic age and [Fe/H] are nearly unchanged given their uncertainties. We see an apparent decay in super-Earths accompanied by a rise in sub-Neptunes as [α/Fe] increases. This apparent trend is confirmed in the second panel of Figure 8, which shows the asymmetry of the valley Avalley as a function of [α/Fe]; a linear decay model is preferred over the constant model with a ΔAIC = 13.75 and a confidence level of 99.99% from Monte Carlo simulation. The best fit is

Equation (17)

Figure 9.

Figure 9. The period–radius diagram (top panel) and radius distribution (bottom panel) of planets with different [α/Fe] from our selected sample from the LAMOST-Gaia-Kepler catalog. The median values and 1σ uncertainties of kinematic age, [Fe/H], and [α/Fe] of each bin are printed at the top.

Standard image High-resolution image

Except for Avalley, the other morphology metrics do not show a monotonic trend with [α/Fe]. The Cvalley seems to be a constant function of [α/Fe]. For the other three metrics (i.e., ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP ), they all seem to follow a falling−rising V-shape pattern. However, we caution that this pattern could be unphysical because there are only four data points (four bins) here, while a V-shape model needs at least four parameters. Besides the small number of bins, the total range of [α/Fe] (between −0.1 and 0.1) is also small, and the differences in [α/Fe] between adjacent bins are only ∼1σ (see the horizontal error bars in Figure 9). All these limitations add difficulty to further revealing the physical connection between radius valley morphology and [α/Fe].

In all the above investigations, we eliminate the influence of the observational bias indirectly via constructing control samples. In Appendix B, we calculated the transit detection efficiency and found that the mean detection efficiency, or completeness, does not change substantially for our controlling samples across all TD/D, age, [Fe/H], and [alpha/Fe] (as shown in Figures 10A3). Therefore, we conclude that completeness corrections are not likely to (significantly) impact the main results of our analysis. Since previous studies from observational analyses (e.g., Berger et al. 2020a; Sandoval et al. 2021) and theoretical models (e.g., Gupta & Schlichting 2020; Rogers & Owen 2021) all adopt the raw number ratio of super-Earths to sub-Neptunes, here we also adopt the raw number ratio to facilitate subsequent comparisons and discussions in Section 4.

4. Discussions

In this paper, based on the LAMOST-Gaia-Kepler catalog of PAST II (Chen et al. 2021b), we study the radius valley in the Galactic context. Specifically, we investigated the dependence of the radius valley morphology on Galactic component (TD/D), kinematic age, and metallicity ([Fe/H] and [α/Fe]). The main results are summarized in Table 2. In this section, we will discuss our results from the following aspects. First, we discuss the variation of the radius valley boundary and its effect on our results (Section 4.1). Then, we compare our results to previous relevant works (Section 4.2). Finally, we discuss the implications of our results for planet formation and evolution (Section 4.3).

Table 2. Dependence of Radius Valley Morphology (i.e., Cvalley, Avalley, ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP; Section 2.2) on TD/D (Section 3.1), kinematic age (Section 3.2), [Fe/H] (Section 3.3), and [α/Fe] (Section 3.4)

  Cvalley (Equation (1)) Avalley (Equation (2)) ${R}_{\mathrm{valley}}^{+}$ (Equation (3)) ${R}_{\mathrm{valley}}^{-}$ (Equation (4)) fNP (Equation (5))
TD/D ↑ (Figure 4) − − − − − −
Age ↑ (Figure 6) ↑ (Equation (7)) ↑ (Equation (8)) ↓ (Equation (9)) ⇝ (Equation (10)) ↓ (Equation (11))
[Fe/H] ↓ (Figure 8) ↓ (Equation (12)) ↑ (Equation (13)) ↓ (Equation (14)) ⇝ (Equation (15)) ↓ (Equation (16))
[α/Fe] ↑ (Figure 10) ↓ (Equation (17)) − − − − − −

Note. ↑ : increasing; ↓ : decreasing; ⇝ : broadly unchanged; − − : unclear.

Download table as:  ASCIITypeset image

4.1. Boundaries of the Radius Valley

In this paper, the radius valley is nominally and simply defined at ${R}_{{\rm{p}}0}^{\mathrm{valley}}=1.9\pm 0.2\,{R}_{\oplus }$. In fact, the center of the radius valley is found to be dependent on stellar mass (Wu 2019; Berger et al. 2020a) and planetary orbital period (Van Eylen et al. 2018), which can be characterized as

Equation (18)

where P and M* are planetary orbital period and stellar mass, respectively, and $h={0.26}_{-0.16}^{+0.21}$ (Berger et al. 2020a) and $g=-{0.09}_{-0.04}^{+0.02}$ (Van Eylen et al. 2018) are the corresponding slopes.

To take the above dependence into account, following Zhu & Dong (2021), we rescaled all planet radii as

Equation (19)

where h and g were set as 0.26 and −0.09, respectively. With the rescaled radii, then the boundaries (Section 2.1) used to classify different planet populations should be updated accordingly, i.e.,

  • 1.  
    Valley planet (VP): $1.7\,{R}_{\oplus }\leqslant \tilde{{R}_{p}}\leqslant 2.1\,{R}_{\oplus };$.
  • 2.  
    Super-Earth (SE): ${R}_{p}\geqslant 1\,{R}_{\oplus }\,\mathrm{and}\,\tilde{{R}_{p}}\lt 1.7\,{R}_{\oplus };$.
  • 3.  
    Sub-Neptune (SN): $\tilde{{R}_{p}}\gt 2.1\,{R}_{\oplus }\,\mathrm{and}\,{R}_{p}\leqslant 3.5\,{R}_{\oplus };$.
  • 4.  
    Neptune-size planet (NP): 3.5 R < R p ≤ 6 R.

Note that here the boundaries of the Neptune-size planets are not affected by the change in the valley boundaries.

With the above new criteria, we recalculated the five metrics of radius valley morphology, which are plotted as hollow gray circles in Figures 4, 6, 8 and 10, respectively. As can be seen, the recalculated Cvalley, Avalley, fNP, ${R}_{\mathrm{valley}}^{+}$, and ${R}_{\mathrm{valley}}^{-}$ (open circles) agree well with our nominal results (filled points). To quantitatively characterize the influence of stellar mass and planetary period on radius valley, we also fit those gray data points with the same procedure described in Section 3. The results are all consistent with our nominal results within 1σ uncertainty.

Figure 10.

Figure 10. The five statistics to characterize the radius valley morphology (i.e., Cvalley, Avalley, ${R}_{\mathrm{valley}}^{+}$, ${R}_{\mathrm{valley}}^{-}$, and fNP) as functions of [α/Fe]. In Section 4.1, we rescale the planet radii to eliminate the influence of stellar mass and planetary orbital period on the boundary of radius valley. The results with rescaling radii are plotted as open circles and shifted to the right to avoid overlapping.

Standard image High-resolution image

Based on the above analysis, we therefore conclude that our results (e.g., Figures 4, 6, 8, and 10) are not (significantly) affected by the dependence of radius valley on stellar mass and planetary orbital period. Here we did not implement the rescaled radii from the beginning because the boundaries between sub-Earth and super-Earth, sub-Neptune, and Neptune should not be scaled. Once taking the rescaled radii, the above boundaries should be different for different planets and cannot be easily plotted in Figures 3, 5, 7, and 9.

4.2. Comparisons to Other Studies

4.2.1. Studies of Age Dependence

During the writing of this paper, we noted that some recent works have also investigated the dependence of planetary size distribution on stellar age. Berger et al. (2020b) used the isochrone age from the Gaia-Kepler stellar properties catalog and found that the number ratio of super-Earths to sub-Neptunes increases from 0.61 ± 0.09 for age <1 Gyr to 1.00 ± 0.10 for age >1 Gyr. Shortly afterward, using the isochrone age from the CKS sample, Sandoval et al. (2021) showed that the number ratio of super-Earths to sub-Neptunes rises monotonically with age, from 0.76 ± 0.08 at <1 Gyr to 0.94 ± 0.07 at >1 Gyr. More recently, David et al. (2021) reported that the radius valley appears emptier in younger systems (>2–3 Gyr) but more filled in older systems. However, such a trend is not seen in Berger et al. (2020a) or in Sandoval et al. (2021).

In this paper, we address the age dependence from a different angle of view by using the kinematic age based on the LAMOST-Gaia-Kepler catalog (Chen et al. 2021b). By defining a series of metrics, we systematically investigated how the radius valley morphology evolves with age. One of the metrics, Avalley, measuring the asymmetry on the two sides of the radius valley, is defined as the logarithm of the number ratio of super-Earths to sub-Neptunes (Equation (2)). We found Avalley ∼ 0 for age <1 Gyr, and it increases to ∼0.2 for age >1 Gyr (Figure 6), corresponding to an increase in the number ratio of super-Earths to sub-Neptunes from ∼1.0 to 1.5. The relative increase by ∼50% is a bit larger than that of Sandoval et al. (2021) but agrees well with that of Berger et al. (2020a). In addition, we note that the increase of the number ratio of super-Earths to sub-Neptunes seems mainly completed within the first ∼1–2 Gyr, which is consistent with the result of Sandoval et al. (2021, see their Figure 3). The absolute value of the number ratio of super-Earths to sub-Neptunes is systematically higher in this work. Such a systematic difference may not be unexpected, as neither the criterion to select the whole sample nor that to define the super-Earths and sub-Neptunes is the same. For example, Berger et al. (2020a) focus on stars between 5700 and 7900 K and define sub-Neptunes as planets between 1.8 and 3.5 R. In this paper, we do not adopt any stellar temperature cut, and thus our sample has relatively more cooler stars and smaller planets. We also define sub-Neptunes in a smaller radius range between 2.1 and 3.5 R. Therefore, it is expected that the number ratio of super-Earths to sub-Neptunes should be systematically higher in this work than in Berger et al. (2020a).

Like Berger et al. (2020a) and Sandoval et al. (2021), we did not see the trend either, i.e., radius valley appears emptier in younger systems than in older systems as claimed by David et al. (2021). Instead, by defining another metric CValley to measure the contrast of the radius valley, we found an opposite trend, i.e., the valley becomes more striking with increasing age (Figure 6). We note that some key differences in the methodology could be the cause of the above disagreement. In this paper, in order to isolate the age effect, we have controlled other stellar parameters to have similar distributions across different age bins (Figure A3), while such a parameter control process has not been conducted in David et al. (2021). As can be seen in Figures 2 and 3 of David et al. (2021), stars in the youngest bin are mostly confined to large metallicity ([Fe/H] > 0), while stars in the oldest bin have much broader distributions in [Fe/H]. Given the significant dependence of radius valley on [Fe/H] as shown in Section 3.3 and also in Owen & Murray-Clay (2018; see more discussion below), we therefore suspect that the age trend observed in David et al. (2021) could be affected by the effects of other stellar properties, e.g., [Fe/H].

4.2.2. Studies of Metallicity Dependence

The [Fe/H] dependence of radius distribution of small planets (<6 R) has been investigated by some previous studies. With the LAMOST spectra, Dong et al. (2018) found that, similar to hot Jupiter, hot planets (P < 10d) with radii between 2 and 6 R (dubbed as "Hoptune") are mostly singles and preferentially around stars with higher [Fe/H]. This trend was then confirmed by Petigura et al. (2018) with the Keck spectra, which further demonstrated that there is a greater diversity of planets around higher [Fe/H] stars. With the CKS sample, Owen & Murray-Clay (2018) studied the [Fe/H] dependence of radius distribution for planets around the radius valley. They found that the radius distribution of planets above the valley (≳1.8 R) is shifted to larger radii around higher [Fe/H] stars, leading to a wider radius valley around higher-[Fe/H] stars. This can be clearly seen from their Figure 2, which also reveals that the ratio of super-Earths to sub-Neptunes is smaller for higher-[Fe/H] stars. All their findings are consistent with our results (Figure 8). Their results on the radius shift of planets above the valley agree with our results on the increase in ${R}_{\mathrm{valley}}^{+}$ and fNep with [Fe/H]. Their results on the wider valley and smaller ratio of super-Earths to sub-Neptunes for higher-[Fe/H] stars agree with our results on the increase in CValley and decrease in Avalley with [Fe/H], respectively.

However, with the CKS VII sample (Fulton & Petigura 2018), recently Kutra et al. (2021) argued that the observed correlation between stellar metallicity and planetary radius could be explained by the combination of another two correlations, i.e., the stellar mass–metallicity correlation and the stellar mass–planetary radius correlation. Their result is inconsistent with ours, in which there is still significant correlation between stellar metallicity and planetary radius even after parameter control in stellar mass. In order to understand the reasons for this discrepancy, we performed the following two exercises:

  • 1.  
    We downloaded the CKS VII sample and conducted similar correlation analyses to those in Kutra et al. (2021). Figure C1 reproduces the correlations of M*–[Fe/H], M*R p , and [Fe/H]–R p as seen in Figure 1 of Kutra et al. (2021) quantitatively. Based on these correlations, Kutra et al. (2021) argue that the M*–[Fe/H] and M*R p correlations are intrinsic while the [Fe/H]–R p correlation is just a projection of the former two. Nevertheless, here we find that the situation is more complicated. As shown in Figure C1, there are more correlations that were not considered in Kutra et al. (2021), namely, the stellar age is anticorrelated with stellar mass and metallicity, which is generally expected from the theory of star formation and evolution (e.g., Bensby et al. 2014; Chen et al. 2021a). In fact, all these observed correlations are not quantitatively intrinsic because any of them is more or less affected by the others. Therefore, these apparent correlations should be taken with caution, and they are not reliable for direct use and interpretation.
  • 2.  
    Bearing this in mind, we then apply parameter control to the above CSK VII sample. Specifically, following the same procedure as in Section 3.3, we divided the whole sample into four bins according to their [Fe/H] and used the NearestNeighbor method to select the nearest neighbor in the space of the controlled parameters (i.e., stellar mass, radius, age, and CDPP) from stars in the last three bins for every star in the first bin. After such a parameter control process, stars in different [Fe/H] bins have similar distributions in mass, radius, and age (as shown in Figure C2), and as expected, the mutual correlations among mass, radius, and age become much weaker (as shown in Figure C3, all p-values are larger than 0.05). In contrast, the correlation between [Fe/H] and R p still remains as significant (p-value = 10−4) as before.

To summarize the above two exercises, we learn that the reason for the discrepancy is not in the different samples but in the different methods used between Kutra et al. (2021) and this work. The former reaches their conclusion by directly analyzing the apparent correlations, which we argue could be unreliable. Our work attempts to extract the intrinsic correlation by controlling other parameters to isolate the effect of [Fe/H].

As for [α/Fe], early studies (e.g., Zhao et al. 2002) found that giant planet host stars show a slight overabundance for some α-elements. Later, the overabundance of α-elements is found to be most striking for iron-poor regimes (Haywood 2008; Kang et al. 2011), and it is exhibited not only in stars hosting giant planets but also in stars hosting Neptune and super-Earth class planets (Adibekyan et al. 2012). Recently, Brewer & Fischer (2018) found that there is no significant difference in Mg/Si ratio between super-Earth hosts and sub-Neptune hosts. In this paper, with the LAMOST-Gaia-Kepler catalog of PAST II, we found that α-elements play an important role in shaping the radius valley, leading to a decrease in the asymmetry (${\dot{A}}_{\mathrm{valley}}\lt 0$; Figure 10), i.e., smaller number ratio of super-Earths to sub-Neptunes for stars with higher [α/Fe].

4.3. Implications for Planet Formation and Evolution

4.3.1. Radius Valley Emerged before Gyr and Evolving beyond Gyr

In this paper (Section 3.2), we investigate the temporal evolution of radial valley and find that both the contrast (Cvalley) and the asymmetry (Avalley, or the ratio of super-Earths to sub-Neptunes equivalently) of the valley increase with age (the top two panels of Figure 6). Qualitatively, our results suggest that the radius valley is evolving on gigayear timescales, which generally supports the evolutionary models driven by photoevaporation (e.g., Owen & Wu 2013; Jin et al. 2014; Lopez & Fortney 2016) and/or cooling luminosity of the planetary core (Ginzburg et al. 2016, 2018; Gupta & Schlichting 2019, 2020). Quantitatively, as we will discuss below, the radius valley morphology as a function of time (Equation (7)) may suggest that both the photoevaporation and core-powered mass-loss mechanisms should be at play, providing constraints to quantify their roles in planetary evolution.

As shown in Figures 5 and 6, the radius valley had already emerged before 0.5 Gyr. Afterward, it continues evolving beyond gigayears, with the number ratio of super-Earths to sub-Neptunes increasing by ∼50% from 0.45 to 4.58 Gyr. Neither the photoevaporation mechanism alone nor the core-powered mechanism alone can fully explain the above observational results. On one hand, although the photoevaporation mechanism can readily form a radius gap well before 0.5 Gyr, the increase in the ratio of super-Earths to sub-Neptunes is somewhat too small (only ∼20% from 0.77 ± 0.08 at t < 1 Gyr to 0.95 ± 0.08 at t > 1 Gyr as shown in Figure 16 of Rogers & Owen 2021) because the photoevaporation operates mostly within the first few hundred Myr (although see King & Wheatley 2021). On the other hand, although the core-powered mechanism operates on a much longer timescale of a few Gyr and thus causes a sufficient increase in the ratio of super-Earths to sub-Neptunes over gigayears (from 0.06 to 0.52 at ages of 0.5 and 3 Gyr by a factor of ∼9; see Figure 10 of Gupta & Schlichting 2020), the radius valley under this mechanism emerges somewhat too late (after ∼1 Gyr as shown in Gupta & Schlichting 2020). Nevertheless, a hybrid model that combines both the photoevaporation and core-powered mechanisms seems promising to fully explain our results. The photoevaporation and core-powered mechanisms could be complementary to each other, namely, the former dominates the early formation and evolution of the valley, while the latter further strengthens the long-term evolution of the valley. Future detailed studies will test and further constrain the hybrid model.

4.3.2. Constraints on Planetary Thermodynamic Evolution

Planets born with significant gaseous envelope are expected to significantly shrink in size owing to long-term thermodynamic evolution, i.e., planet cooling and contraction (Fortney et al. 2011; Linder et al. 2019). The evidence of such a thermodynamic evolution is clearly shown in our result in Figure 6. As can be seen, both the fraction of Neptune-size planets (fNP) and the average radius of planets above the radius valley (${R}_{\mathrm{valley}}^{+}$) continuously decrease with age. This result also implies that the bulk of planets above the radius valley (R p > 2.1 R) should contain significant gaseous envelope (e.g., H/He atmosphere). In contrast, planets below the radius valley (R p < 1.7 R) do not show significant change in their sizes over gigayears, which may imply that they are made of mostly bare cores (Weiss & Marcy 2014; Rogers 2015), in line with the expectation of the atmosphere escape models (Owen & Wu 2017; Jin & Mordasini 2018; Ginzburg et al. 2018).

Furthermore, we find that ${R}_{\mathrm{valley}}^{+}$ decreases with age by a slope of $d(\mathrm{log}R)/d(\mathrm{log}t)=-{0.08}_{-0.01}^{+0.01}$, which quantitatively places an observational constraint on the thermodynamic evolution of sub-Neptunes and Neptunes. For comparison, Gupta & Schlichting (2020) found a slope of $d(\mathrm{log}R)/d(\mathrm{log}t)=\sim -0.1$ from their theoretical model by including the effect of core-powered atmosphere mass loss. Nevertheless, the thermodynamic evolution depends not only on the core cooling but also on stellar irradiation, e.g., photoevaporation (Rogers & Owen 2021), planetary composition (Chen & Rogers 2016), etc. We also explored the average radii of sub-Neptunes (2.1–3.5 R) and Neptunes (3.5–6 R) separately and found that they both decrease with increasing age continuously, suggesting that the envelopes of sub-Neptunes and Neptunes all shrink with age. Future studies, by comparing our observational results to a comprehensive thermodynamic evolution model, could potentially reveal more insights on the compositions of sub-Neptunes and the roles of different mass-loss mechanisms.

4.3.3. Effects of Metallicity on Planetary Formation and Evolution

In Section 3.3, we have explored the dependence of radial distributions on [Fe/H] (Figures 78). Our results show that the ratio of super-Earth to sub-Neptunes decreases with the increase of [Fe/H]. We also find that the average radii of planets above the radius valley is larger around metal-richer stars and thus forms a wider radius valley. The number fraction of Neptune-sized planets grows with increasing [Fe/H]. Our results are also consistent with Wilson et al. (2022), which found that stellar metallicity could affect the bulk density of sub-Neptunes.

Qualitatively, the above results could be explained by the following two scenarios. First, from the view of planetary formation, planets around metal-rich stars are expected to have more massive cores and accrete more gas, forming larger sub-Neptunes and/or Neptune-size planets (e.g., Mordasini et al. 2012; Owen & Murray-Clay 2018; Emsenhuber et al. 2021). Second, from the view of planetary thermodynamic evolution, planets are expected to cool and contract slower with higher opacity of the envelope. Assuming that the opacity is correlated to the metallicity of the protoplanetary disk, as well as the [Fe/H] of the host star, one would expect that sub-Neptunes and Neptune-size planets are more frequent and larger around stars with higher metallicity at the same age (e.g., Gupta & Schlichting 2020).

Quantitatively, we find that the average radius of sub-Neptunes increases with [Fe/H] by a slope of $d\mathrm{log}{R}_{\mathrm{valley}}^{+}/d[\mathrm{Fe}/{\rm{H}}]\,={0.094}_{-0.011}^{+0.012}$, which is well consistent with the planetary cooling simulations by Gupta & Schlichting (2020), who found a slope of $d\mathrm{log}R/d[\mathrm{Fe}/{\rm{H}}]\sim 0.1$. Thus, in terms of the slope, our results provide evidence to support the second scenario. Nevertheless, this does not mean that the first scenario is ruled out. In fact, the second scenario alone may not fully explain all the observations. For instance, we note that the observed ${R}_{\mathrm{valley}}^{+}$ is symmetrically larger than the simulation results of Gupta & Schlichting (2020, see also their Figure 6). One possible explanation could be that Gupta & Schlichting (2020) did not consider the effect of [Fe/H] on the core/gas accretions and the enlargement of planetary radius (potential evidence for the first scenario) during the process of planetary formation.

As for [α/Fe] (Section 3.4), we have found that the number ratio of super-Earths to sub-Neptunes decreases significantly with increasing [α/Fe]. That is to say that sub-Neptunes are preferentially around α-rich stars. As α-element abundances provided by LAMOST are the average abundances of Mg, Si, Ca, and Ti elements, one possible explanation is that planets around α-rich stars could be born in disks with more silicates and thus have larger rocky cores to accrete more gas, forming more sub-Neptunes than super-Earths. Our result suggests that metal elements other than Fe (e.g., Mg, Si, Ca, and Ti) also play important roles in the formation of sub-Neptunes. On this basis, we predict that sub-Neptunes around α-richer stars with similar [Fe/H] will have relatively lower densities.

5. Summary

Using the LAMOST-Gaia-Kepler kinematic catalogs provided by PAST II (Chen et al. 2021b), we explored the radius valley morphology of small planets (1–6 R) in the Galactic context. Specifically, we define a set of metrics to quantify the radius valley morphology (Section 2.2) and investigate whether and how they are correlated with the properties of host stars, i.e., the relative membership probability between thick- and thin-disk stars (TD/D), the kinematic age, and metallicity, i.e., [Fe/H] and [α/Fe] (Section 3). We did not find any significant correlation between the radius valley and TD/D (Section 3.1), which is not unexpected because TD/D itself strongly depends on stellar age and metallicity, and these properties could affect the radius valley in different ways. After applying parameter control to isolate the effects of stellar age and metallicity, we then find a number of correlations, which are summarized as follows and also in Table 2.

  • 1.  
    For the dependence on age (Figures 56), we found that the radius gap emerged before 1 Gyr and continued evolving on gigayear timescales. Specifically, both the contrast and the asymmetry (or the number ratio of super-Earths and sub-Neptunes equivalently) of the radial valley increase with age (Equations (7)–(8)). The fraction of Neptune-sized planets and the average radii of planets above the radius gap (2.1–6 R) are found to decrease with age (Equations (9) and (11)), while the average radii of planets below the radius valley (<1.7 R) show no significant dependence on age (Equation (10)).
  • 2.  
    For the dependence on [Fe/H], the contrast of the radius valley increases, while the asymmetry (or the number ratio of super-Earths and sub-Neptunes equivalently) decreases with [Fe/H] (Equations (12)–(13)). The fraction of Neptune-sized planets and the average radii of planets above the radius gap (2.1–6 R) are found to increase with [Fe/H] (Equations (14) and (16)), while the average radii of planets below the radius valley ( < 1.7 R) are broadly unchanged (Equation (15)).
  • 3.  
    For the dependence on α-elements, we found that the asymmetry of the valley (or the number ratio of super-Earths and sub-Neptunes equivalently) decreases with [α/Fe] (Equation (8)). Apart from this, we did not find any significant correlation between [α/Fe] and other metrics of the valley morphology, probably because the uncertainty of [α/Fe] is relatively large compared to the range of [α/Fe] in our sample.

Our results on the long-term temporal evolution of the radius valley support the evolutionary models driven by atmospheric mass loss due to photoevaporation (Owen & Wu 2013; Jin et al. 2014) and cooling planetary cores (Ginzburg et al. 2016, 2018; Gupta & Schlichting 2019, 2020) and suggest that the radial valley is a result of planetary atmospheric loss over gigayear timescales, which is primarily formed by the photoevaporation at early stages and further strengthened by the combined effect of photoevaporation and core-powered mass loss (Section 4.3.1). The dependence of radius distribution on age also provides evidence of and constraints on planetary thermodynamic evolution, as well as insights on planetary compositions (Section 4.3.2). Planets above the radius valley (2.1 R < R p < 6 R) should be made with a significant H/He envelope, while those below the radius valley (1 R < R p < 1.7 R) are likely to be bare rocky cores. The dependence of radius distribution on metallicity suggests that not only Fe but also other metal elements (Mg, Si, Ca, Ti, etc.) play important roles in the formation and evolution of super-Earths and sub-Neptunes (Section 4.3.3).

Our work emphasizes the importance of parameter control in studying the links between various planetary properties and stellar characteristics. Future studies, both from observational analysis (e.g., very young stars with age less than 100 Myr) and from numerical simulations of various theoretical models, will test our results and further provide more insights and constraints on the formation, evolution, and compositions of exoplanets.

We thank Wei Zhu, Yanqin Wu, Fei Dai, Mao-Sheng Xiang, and James E. Owen for helpful discussions and suggestions. This work is supported by the National Key R&D Program of China (No. 2019YFA0405100) and the National Natural Science Foundation of China (NSFC; grant Nos. 11933001, 11973028, 11803012, 11673011, 11903005, 12003027). J.-W.X. also acknowledges the support from the National Youth Talent Support Program and the Distinguish Youth Foundation of Jiangsu Scientific Committee (BK20190005). D.-C.C. also acknowledges the Cultivation project for LAMOST Scientific Payoff and Research Achievement of CAMS-CAS.

This work has included data from Guoshoujing Telescope (the Large Sky AreaMulti-Object Fiber Spectroscopic Telescope LAMOST), which is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. This work presents results from the European Space Agency (ESA) space mission Gaia. Gaia data are being processed by the Gaia Data Processing and Analysis Consortium (DPAC). Funding for the DPAC is provided by national institutions, in particular the institutions participating in the Gaia MultiLateral Agreement (MLA). We acknowledge the NASA Exoplanet Science Institute at IPAC, which is operated by the California Institute of Technology under contract with the National Aeronautics and Space Administration.

Appendix A: Construction and Validation of the Control Bins

The radius valley has been found to be correlated to multiple stellar properties (e.g., age, mass, [Fe/H]). These stellar properties are not independent but found to be interrelated. For example, stars with larger TD/D (kinematic age) are found to be poorer in [Fe/H] (e.g., Bensby et al. 2014; Chen et al. 2021a). The observation biases, caused by the detection efficiency, also need to be considered. For example, the stellar activity and noise are anticorrelated to TD/D and age (Figures 5 and 6 of PAST II; Chen et al. 2021b) and may affect the detection ability of planets. To control the stellar detection ability and noise level, here we adopt the Kepler DR25 CDPP, and the timescale is chosen to be 4.5 hr, as the calculated transit duration is about 4.7 hr if taking the median values of stellar mass, radius, and planetary period in our sample. For example, we divided the sample into four equal-sized bins (∼111 planets) according to TD/D and compared their cumulative distributions of stellar parameters. As shown in Figure A1, with the increase of TD/D, both the stellar radius and CDPP (i.e., noise levels) grow, thus reducing planet detection efficiencies and causing bias toward larger planets for stars with larger TD/D. This is expected, as stars with larger TD/D are more likely to be in the thick disk and thus older and more distant. Therefore, in order to isolate the influence of a single property, one needs to eliminate the influences of other properties and detection efficiency.

Figure A1.

Figure A1. The cumulative distributions of stellar mass M*, radius R*, CDPP 4.5 hr, [Fe/H], and [α/Fe] for the four bins with different relative probabilities between thick disk and thin disk TD/D (Section 3.1) before parameter controlling. Here we directly divide our sample into four samples with approximately equal size (∼111) according to TD/D. In each panel, p denotes the p-value of the two-sample K-S test for the distributions of neighboring stars belonging to the last three bins compared to the first bins.

Standard image High-resolution image

To do this, we divide the whole sample in Section 2.1 into four subsamples according to the studied property (i.e., TD/D, age, [Fe/H], and [α/Fe]). We take 10% (44) of stars with the minimum TD/D, youngest kinematic age, highest [Fe/H], and lowest [α/Fe] as the first bins for Sections 3.13.4, respectively. The subsequent samples are divided into three bins of equal size (134). Then, we construct control samples by adopting the NearestNeighbors function in scikit-learn (Pedregosa et al. 2011) to select the nearest neighbor in controlled properties for every star belonging to the first bin from stars belonging to the last three bins. In the case that a star is selected as the nearest neighbor multiple times, we exclude duplicate stars.

To examine whether the control samples have similar distributions in other stellar properties, we make K-S tests and check the resulted p-values. As can be seen in Figures A2A5, all the K-S test p-values of controlled parameters are larger than 0.15, demonstrating that these parameters have been well controlled and thus do not differ significantly in their distributions. Note that in Figure A2 we did not control [Fe/H] and [α/Fe] when exploring the dependence on TD/D because the differences in stellar metallicity between thin- and thick-disk stars are essential and inevitable.

Figure A2.

Figure A2. The cumulative distributions of stellar mass M*, radius R*, CDPP 4.5 hr, [Fe/H], and [α/Fe] for the four bins with different relative probabilities between thick disk and thin disk TD/D (Section 3.1). In each panel, p denotes the p-value of the two-sample K-S test for the distributions of neighboring stars belonging to the last three bins compared to the first bins. Note that only stellar mass M*, radius R*, CDPP, and Kepler magnitude have been controlled here. We did not control [Fe/H] and [α/Fe] when exploring the dependence on TD/D here because the differences in stellar metallicity between thin- and thick-disk stars are essential and inevitable.

Standard image High-resolution image
Figure A3.

Figure A3. Similar to Figure A2, but here showing the cumulative distributions of stellar mass M*, radius R*, CDPP 4.5 hr, [Fe/H], and [α/Fe] for the four bins with different kinematic ages (Section 3.2). Note that here stellar mass M*, radius R*, CDPP, [Fe/H], [α/Fe], and Kepler magnitude have all been controlled in order to isolate the effect of age.

Standard image High-resolution image
Figure A4.

Figure A4. Similar to Figure A2, but here showing the cumulative distributions of stellar mass M*, radius R*, CDPP 4.5 hr, TD/D, and [α/Fe] for the four bins with different [Fe/H] (Section 3.3). Note that here stellar mass M*, radius R*, CDPP, TD/D, [α/Fe], and Kepler magnitude have all been controlled in order to isolate the effect of [Fe/H].

Standard image High-resolution image
Figure A5.

Figure A5. Similar to Figure A2, but here showing the cumulative distributions of stellar mass M*, radius R*, CDPP 4.5 hr, TD/D, and [Fe/H] for the four bins with different [α/Fe] (Section 3.4). Note that here stellar mass M*, radius R*, CDPP, TD/D, and [Fe/H] have all been controlled in order to isolate the effect of [α/Fe].

Standard image High-resolution image

It is also necessary to test observational biases of the LAMOST-Gaia-Kepler sample from PAST II (Chen et al. 2021b). In Figure A6, we compare the distributions of the apparent magnitude and effective temperature of the planet host stars in the LAMOST-Gaia-Kepler sample and the entire Kepler sample. As can be seen, planet hosts in the LAMOST-Gaia-Kepler sample have relatively smaller apparent Kepler magnitude and temperature. This is not unexpected because we present the LAMOST-Gaia-Kepler sample by cross-matching Kepler target stars with Gaia and LAMOST, and LAMOST observed brighter stars preferentially, as faint stars are harder to accurately characterize. Previous studies have shown that the radius valley morphology changes with magnitude (e.g., Kepler magnitude ≲14 vs. ≳14; Fulton et al. 2017) and temperature (a proxy for mass; e.g., Berger et al. 2020a). Therefore, it is necessary to eliminate the influences of magnitudes and mass. We have already controlled the stellar mass (corresponding to temperature) and radius (which combined corresponding to magnitudes), which could eliminate their influences. For further verification and visual display, we also check the cumulative distributions of Kepler magnitude and temperature. After parameter controlling, different bins of different TD/D (age, [Fe/H], and [alpha/Fe]) do not differ significantly in the distribution of Kepler magnitude and temperature.

Figure A6.

Figure A6. The cumulative distributions of Kepler magnitudes and effective temperature for the planet host stars in the entire Kepler sample (blue) and LAMOST-Gaia-Kepler sample (red).

Standard image High-resolution image

Appendix B: Detection Efficiencies of Different Bins of Various Properties

We calculate the transit detection efficiency using the KeplerPORTs (Burke & Catanzarite 2017) and the detection metrics available from the NASA exoplanet archive (https://exoplanetarchive.ipac.caltech.edu/docs/). Figures B1, B2, B3, and B4 show the 10%, 50%, and 90% average detection efficiency contours, as well as the tranet distributions in the period–radius diagram for the four bins of different TD/D, kinematic age, [Fe/H], and [α/Fe], respectively.

Figure B1.

Figure B1. Orbital periods vs. radii for the planets in the four bins of different TD/D after parameter controlling. In each panel, we plot their average detection efficiencies (10%, 50%, and 90%) for stars in each bin (red lines). For ease of comparison, we also plot the same mean detection efficiencies for all stars in the whole sample (black lines).

Standard image High-resolution image
Figure B2.

Figure B2. Similar to Figure B1, but the four bins of different age.

Standard image High-resolution image
Figure B3.

Figure B3. Similar to Figure B1, but the four bins of different [Fe/H].

Standard image High-resolution image
Figure B4.

Figure B4. Similar to Figure B1, but for the four bins of different [α/Fe].

Standard image High-resolution image

As can be seen, for all the cases, most of the planets after parameter controlling lie above the 90% completeness curve, and nearly all planets are above the 50% completeness curve. Furthermore, for the bins of a given studied property (i.e., TD/D, age, [Fe/H], and [α/Fe]), the detection efficiencies are close to each other. Based on the above analysis, we conclude that our results will not be (significantly) changed after completeness correction.

Appendix C: Correlations between Planetary and Stellar Properties for the CKS VII Sample with and without Parameter Control

In Section 3.3, using the LAMOST-Gaia-Kepler catalog, we found that the average radii of planets larger than valley planets (2.1–6 R) increase with stellar metallicity after controlling other properties (e.g., stellar mass, age). However, with the CKS VII sample (Fulton & Petigura 2018), recently Kutra et al. (2021) showed that the observed correlation between stellar metallicity and planetary radius could be explained by the combination of the stellar mass–metallicity correlation and the stellar mass–planetary radius correlation, which is inconsistent with our results.

To understand the reasons for this discrepancy, we conducted similar correlation analyses with the CKS VII sample to those in Kutra et al. (2021). As can be seen in Figure C1, we reproduce the correlations of M*–[Fe/H], M*–[Fe/H], and M*R p as seen in Figure 1 of Kutra et al. (2021) quantitatively. Nevertheless, as shown in Figure C1, we also find more correlations that were not considered in Kutra et al. (2021): the stellar age is anticorrelated with both stellar mass and metallicity. Therefore, these apparent correlations are interrelated and should be taken with caution.

Figure C1.

Figure C1. Correlations between planetary and stellar properties for the CKS VII sample before parameter control. In each panel, ρ and p denote the Pearson correlation coefficient and p-value for each pair of variables, respectively.

Standard image High-resolution image

To isolate the effect of metallicity, following the same parameter control procedure as in Section 3.3, we divided the whole sample into four bins according to their [Fe/H] and used the NearestNeighbor method to select the nearest neighbor in the space of the controlled parameters (i.e., stellar mass, radius, age, and CDPP) from stars in the last three bins for every star in the first bin. As shown in Figure C2, the distributions in mass, radius, and age after parameter control are similar for stars in different [Fe/H] bins. Then, we compute the Pearson correlation coefficients and p-values for the mutual correlations among stellar mass, radius, age, and planetary radii. As shown in Figure C3, the correlations among stellar parameters become statistically insignificant with p-values larger than 0.05. In contrast, the correlation between [Fe/H] and R p still remains statistically significant with a p-value = 10−4, which is consistent with our results using the LAMOST-Gaia-Kepler sample.

Figure C2.

Figure C2. The cumulative distributions of stellar mass M*, radius R*, age, and CDPP 4.5 hr for the four bins with different metallicity [Fe/H] of CKS VII after parameter control. In each panel, p denotes the p-value of the two-sample K-S test for the distributions of neighboring stars belonging to the last three bins compared to the first bins.

Standard image High-resolution image
Figure C3.

Figure C3. Similar to Figure C1, but after controlling parameters (i.e., stellar mass, radius, age, and CDPP) to let them have similar distributions to those shown in Figure C2.

Standard image High-resolution image

Above exercises demonstrate that it is crucial to extract the intrinsic correlations (e.g., via parameter control) rather than using the apparent correlations among a number of planetary and stellar properties.

Please wait… references are loading.
10.3847/1538-3881/ac641f