<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Attention is still what you need: Another Round ofExploring Shoup's GGM</title></titleStmt>
			<publicationStmt>
				<publisher>Springer Nature Singapore</publisher>
				<date>12/08/2025</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10655064</idno>
					<idno type="doi"></idno>
					
					<author>Taiyu Wang</author><author>Cong Zhang</author><author>Hong-Sheng Zhou</author><author>Xin Wang</author><author>Pengfei Chen</author><author>Wenli Wang</author><author>Kui Ren</author><author>Chun Chen</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[The generic group model (GGM) is fundamental for evaluating the feasibility and limitations of group-based cryptosystems. Two prominent versions of the GGM exist in the literature: Shoup's GGM and Maurer's GGM. Zhandry (CRYPTO 2022) points out inherent limitations in Maurer's GGM by demonstrating that several textbook cryptographic primitives, which are provably secure in Shoup's GGM, cannot be proven secure in Maurer's model.In this work, we further investigate Shoup's GGM and identify novel limitations that have been previously overlooked. Specifically, to preven t generic algorithms from generating valid group elements without querying the oracle, the model typically employs sufficiently large encoding lengths. This leads to sparse encodings, a setting referred to as the sparse generic group model (sparse GGM). We emphasize that this sparseness introduces several constraints: -Groups with AE and Black-Box Separation: Shoup's GGM is typically instantiated with elliptic curve groups, which admit admissible encodings (AE)-functions mapping from Zp to elliptic curve points. We establish a black-box separation, showing that the sparse GGM fails to capture cryptographic groups that are both (1) computational Diffie-Hellman (CDH) secure and (2) compatible with admissible encodings. -Comparison with EC-GGM: We examine the relationship between the sparse GGM and the Elliptic Curve Generic Group Model (EC-GGM) introduced by Groth and Shoup (EUROCRYPT 2022), which inherently yields CDH-secure groups with admissible encodings. Within the framework of indifferentiability, we prove that EC-GGM is strictly stronger than sparse GGM. -Dense Groups and Black-Box Separation: We revisit groups with dense encodings and establish a black-box separation b etween CDH-secure dense groups and the sparse GGM.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Since the seminal work of Diffie and Hellman <ref type="bibr">[DH76]</ref>, group-based cryptosystems have become a cornerstone of modern cryptography, enabling secure key exchange, public key encryption, digital signatures, and many more. Building on this f oundation, various types of cryptographic groups have been proposed, including multiplicative groups <ref type="bibr">[DH76,</ref><ref type="bibr">ElG85]</ref>, elliptic curve (EC) groups <ref type="bibr">[Mil86,</ref><ref type="bibr">Kob87]</ref>, and pairing-friendly groups [FR94,BGOS07], all of which play crucial roles in constructing c ryptographic schemes and protocols.</p><p>In practical group-based cryptosystems, the length of group encodings plays a crucial role in efficiency, particularly affecting communication complexity. At equivalent security levels, groups with more compact encodings are typically preferred for implementation. NIST SP 800-186 <ref type="bibr">[CMR+23]</ref> lists several recommended curves with 128-bit security, including Curve 25519, an elliptic curve defined over a 255-bit prime field. With standard point compression, a group element on Curve 25519 can be encoded in 256 bits (255 bits plus 1 for sign). Because EC groups generally offer more compact encodings at the same security level, they are widely favored for practical cryptographic constructions.</p><p>Importantly, EC groups always have admissible encodings; see <ref type="bibr">[BF03,</ref><ref type="bibr">Ica09,</ref><ref type="bibr">BCI+10,</ref><ref type="bibr">LPS23]</ref>. Admissible encodings are broadly defined as efficiently computable functions that map from Z p<ref type="foot">foot_0</ref> to group elements, satisfying two key properties: regularity (having constant preimage sizes) and preimagecomputability (enabling efficient computation of all preimages for a given element in the range). These encodings enable the oblivious sampling of group elements without revealing their discrete logarithms, a critical feature for many group-based cryptosystems <ref type="bibr">[BF03,</ref><ref type="bibr">BBB+18,</ref><ref type="bibr">MR19]</ref>.</p><p>However, a fundamental limitation arises when attempting to prove the security of group-based cryptosystems-the inability to establish the unconditional hardness of the underlying computational assumptions (e.g., discrete logarithm, computational or decisional Diffie-Hellman) for any specific group. Over the past few decades, researchers have explored various approaches to establishing lower bounds on computational hardness, with the most prevalent method being the generic group model (GGM), where algorithms are limited to performing only generic group operations.</p><p>Roughly speaking, there are two widely accepted versions of t he generic group model: Maurer's GGM <ref type="bibr">[Mau05]</ref> and Shoup's GGM <ref type="bibr">[Sho97]</ref>. Maurer's GGM is modeled as a stateful system where algorithms make queries by referencing two group elements encountered during the computation (e.g., the 5th and 9th elements). In contrast, Shoup's GGM is modeled as a random injection from the additive group Z N to sufficiently long strings, where algorithms specify queries by providing two previously encountered strings from the computation.</p><p>Although both formulations appear in the literature, Maurer's GGM is generally viewed as more restrictive. In particular, digital-signature schemes are impossible in <ref type="bibr">Maurer's GGM [DHH+21]</ref>, and foundational constructions such as the Blum-Micali pseudorandom generator and the Goldreich-Goldwasser-Micali pseudorandom function cannot be realized in this model <ref type="bibr">[Zha22]</ref>. For these reasons, we focus exclusiv ely on Shoup's GGM.</p><p>In Shoup's GGM, the length of group encodings also plays a significant role. Numerous applications (e.g., <ref type="bibr">[Gro16,</ref><ref type="bibr">Zha22,</ref><ref type="bibr">HMQS23,</ref><ref type="bibr">LZ24]</ref>) within this model rely on the requirement that algorithms cannot produce valid group elements without querying the oracle. To uphold this property, the generic group model must employ sparse encodings, making the sparse GGM a critical component of Shoup's framework. How ever, sparse GGM does not support oblivious sampling, which implies that admissible encodings are effectively absent in sparse GGM!</p><p>Attention is still needed! A potential gap arises between sparse variant of Shoup's GGM and practical cryptographic groups, such as elliptic curve groups or other dense g roups. Therefore, we plan to conduct further exploration of Shoup's GGM, leading to the following research question:</p><p>Can sparse GGM effectively and accurately model elliptic curve groups or other dense cryptographic groups?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.1">Our Results</head><p>In this work, we answer the question above negatively. Specifically, let G be a cryptographic group such that: (1) computational Diffie-Hellman (CDH) is assumed to hold with respect to G; (2) the group enco dings are dense or the group is associated with nice admissible encodings. Then G does not exist in the sparse GGM unconditionally.</p><p>In order to understand our results better, we start with a brief explanation of several important concepts including (1) sparse generic group models (GGMs), (2) dense groups, and (3) groups with nice admissible encodings. Let &#955; be the security parameter, -Sparse and dense GGMs. We say GGM G is sparse if the GGM represents a random injection from Z N to S, where N is a &#955;-bit prime, S = {0, 1} m , and the difference m&#955; satisfying m&#955; &#8805; &#969;(log &#955;). Conversely, we say the GGM G is dense if the difference m&#955; &#8804; &#920;(log &#955;).</p><p>-Sparse and dense group encodings. Similarly, we say a cryptographic group G of order N is sparse if the length of its group encodings, denoted by m, satisfies m&#955; &#8805; &#969;(log &#955;), and we say the group is dense if m&#955; &#8804; &#920;(log &#955;). -Groups with nice admissible encodings. We define a group G of order N to have a nice admissible encoding with respect to a constant and a polynomial poly AE if there exists an efficiently computable function AE : Z p &#8594; G satisfying the following conditions: (1) the preimage of any group element under AE is efficiently computable; (2) the encoding is -regular, meaning the size of any preimage set is at most ; (3) the domain size p is sufficiently large, i.e., p N &#8805; poly AE . As shown in [Ica09], all elliptic curve groups satisfy these conditions with = 4, and are therefore considered groups with nice admissible encodings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.1.1">Impossibility Results in Sparse GGM/GBM</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Impossibility of Groups with Admissible Encodings in a Sparse GGM.</head><p>As previously discussed, the sparse GGM does not provide an admissible encoding. We argue that this limitation is an inherent drawback of the sparse GGM and proceed to establish a black-box separation between CDH-secure groups with admissible encodings and the sparse GGM.</p><p>Theorem 1.1 (Informal). For any constant , CDH-secure groups with admissible encodings with respect to do not exist in the sparse generic group model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Impossibility of Dense Groups in a Sparse GGM.</head><p>We now turn our attention to the dense groups. Intuitively, the encodings in sparse GGM cannot be compressed in a generic manner and we demonstrate that this limitation represents another inheren t drawback of the sparse GGM and proceed to establish a black-box separation between CDH-secure dense groups and the sparse GGM.</p><p>Theorem 1.2 (Informal). CDH-secure dense groups do not exist in the sparse generic group model. Remark 1.1. In [JZW+24], <ref type="bibr">Ji et al. investigate</ref> the relationship between CDHsecure groups with varying lengths of group encodings and demonstrate that shorter CDH-secure groups are separated from longer GGMs. However, we highlight that their analysis heavily depends on the assumption that both the groups and the GGM share the same security parameter-a condition we believe is not fully justified. In contrast, our results establish the black-box separation between dense groups and sparse GGM, resolving the open problem posed in <ref type="bibr">[JZW+24]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Impossibility of Groups with Admissible Encodings or Dense Groups in a Sparse GBM.</head><p>We further argue that the inherent difficulty in constructing groups with admissible encodings or dense groups persists as long as the idealized model remains sparse, even when the model is extended to the generic bilinear group model (GBM). The GBM B models two generic groups, i.e., the source generic group and the target generic group, and we say B is sparse if both the source generic group and the target generic group are sparse. We then establish black-box separations between CDH-secure g roups with admissible encodings (CDH-secure dense groups) and the sparse GBM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Theorem 1.3 (Informal).</head><p>For any constant , CDH-secure groups with admissible encodings with respect to do es not exist in the sparse generic bilinear group model.</p><p>Theorem 1.4 (Informal). CDH-secure dense groups do not exist in the sp arse generic bilinear group model. <ref type="bibr">Groth and Shoup [GS22]</ref> introduce a variant of the GGM, known as the elliptic curve generic group model (EC-GGM). Let E be the set of all points on an elliptic curve group of order N . The EC-GGM models a random injection from Z N to E, while preserving certain structural properties inherent t o elliptic curvesfor instance, the preservation of which points share the same x-coordinate. It follows that CDH-secure groups with nice admissible encodings exist relative to the EC-GGM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.1.2">Exploring the Relationship Between EC-GGM and Sparse GGM</head><p>Theorem 1.5 (Informal). CDH-secure groups with nice admissible encodings exist in the elliptic curve generic group model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Relationship Between EC-GGM and Sparse GGM.</head><p>We next study the relationship between the EC-GGM and the sparse GGM in the f ramework of indifferentiability. Following Zhang and Zhandry's analysis [ZZ23], we explore the relationship against computationally bounded adve rsary and prove that: Theorem 1.6 (Informal). In the framework of indifferentiability, EC-GGM is strictly stronger than sparse GGM.</p><p>Remark 1.2. In [JZW+24], <ref type="bibr">Ji et al. also</ref> prove that, within the framework of indifferentiability, the dense GGM is strictly stronger than the sparse GGM. However, we stress that their analysis also highly depends on the assumption that both the dense GGM and the sparse GGM share the same se curity parameter, which we believe is not fully justified. In contrast, our separation between the dense GGM and the sparse GGM is unconditional.</p><p>Theorem 1.7 (Informal). CDH-secure dense groups e xist in the dense GGM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.2">Interpretation</head><p>Shoup's GGM serves as the foundation for the group-based cryptosystems. Our findings show that extreme caution must be taken when proving security or establishing black-box separations within Shoup's GGM. We next elaborate it from two perspectives.</p><p>Instantiating the GGM with Elliptic Curve Groups. In the generic group model, algorithms are required to be generic. However, not all algorithms for group-based assumptions, such as the discrete logarithm problem, adhere to this requirement-for example, the index calculus attack against Z * N [Adl79]. Fortunately, in the case of elliptic curve groups, the only known attacks are generic in nature. As a result, the generic group model is typically instantiated using elliptic curve groups. H owever, to the best of our knowledge, in many practical cryptographic constructions (e.g., the zk-SNARKs in <ref type="bibr">[Gro16]</ref>), the generic group model is often treated as a sparse GGM, where adversaries are restricted from obtaining valid group elements without making explicit queries. Furthermore, as shown in [Ica09], all elliptic curve groups fall into the category o f groups with nice admissible encodings.</p><p>Our results establish a black-box separation between CDH-secure groups with admissible encodings and the sparse GGM, highlighting a potential and previously overlooked gap when instantiating the GGM with elliptic curve groups.</p><p>Black-Box Separations within the GGM. The generic group model is used to demonstrate the limitations o f certain group-based cryptosystems. Notably, most known separations <ref type="bibr">[Zha22,</ref><ref type="bibr">HMQS23]</ref> are established within sparse GGM. Our findings highlight important limitations of the black-box impossibility of sparse GGM, as it excludes both CDH-secure groups with nice admissible encodings and CDH-secure dense groups. Consequently, the relativizing separation between these groups and identity-based encryption (IBE) r emains unresolved. Since many practical groups used fall under the category of groups with nice admissible encodings, our results motivate the further study of the complexity of such groups.</p><p>Furthermore, our findings indicate that, when examined in a fine-grained manner, the relationship between the GGM and the GBM must be reinterpreted. Specifically, Zhang and Zhandry [ZZ23] demonstrate that the GBM is strictly stronger than the GGM. In contrast, our results reveal that EC-GGM (or dense GGM) and sparse GBM are, in fact, incomparable.</p><p>In conclusion, our results encourage further investigation into both the construction of practical schemes and the establishment of black-box separations within the EC-GGM (or dense GGM). Additionally, our findings offer a deeper understanding of the complexities in GGMs and GBMs.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.3">Technical Ov erview</head><p>We now provide an overview of the core intuition and no velty behind our techniques.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.3.1">Impossibility of Groups with AE in Sparse GGM.</head><p>To demonstrate the impossibility of a cryptographic primitive P within an idealized model O, we typically employ the black-box separation methodology. Specifically, for any instantiation &#928; O of P, we construct a computationally unbounded adversary A such that (1) A breaks the security of &#928; O ; and (2) A is query-efficient. The seminal result of Impagliazzo and Rudich [IR89], tells that the key agreement primitive cannot exist in the random oracle model (ROM). Specifically, for any protocol in this model, there exists an adversary capable of winning in a key recovery attack (KRA), which breaks the fundamental security requirement for key agreement. Returning to our context, it is important to note that CDH-secure groups with admissible enco dings inherently imply KRA-secure key agreement (e.g., the Diffie-Hellman key exchange protocol). Therefore, if we could establish that KRA-secure key agreement is impossible in sparse GGM, the analysis would be complete.</p><p>For clarity of elaboration, we focus our analysis on a particular case of the key agreement primitive, i.e., the non-interactive key exchange (NIKE). We begin by briefly reviewing t he key concepts behind the impossibility of KRA-secure NIKE in the random oracle model [IR89,BKSY11], and then proceed to integrate those insigh ts into our analysis.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>KRA-Secure NIKE vs. ROM.</head><p>Let &#928; H := (KGen H , SHK H ) be a NIKE scheme in the random oracle model H, where (1) &#928; H achieves perfect correctness, and (2) both KGen and SHK make at most q queries. Consider Alice and Bob as two honest parties, each associated with public keys pk 1 and pk 2 , respectively. We then construct an adversary A that, given pk 1 and pk 2 , o utputs the valid shared key with a good probability. Specifically, the adversary A initializes two empty sets S que-res and S key , and then proceeds through 4q + 1 iterations of the following phases:</p><p>-Simulation. Simulate a proper view for Alice. Specifically, this view contains a set of query/response pairs SA along with a private k ey sk 1 such that: (1) SA is consistent with S que-res<ref type="foot">foot_3</ref> ; and (2) this view induces the correct public key for Alice: KGen SA&#8746;Sque-res (sk 1 ) = pk 1 . Next, compute the shared key based on the simulated view, i.e., shk = SHK &#732; SA&#8746;Sque-res (sk 1 , pk 2 ), and insert shk into the set S key .</p><p>-Update. Update all queries in SA \ S que-res by accessing the random oracle H, and add the corresp onding query/response pairs into the set S que-res .</p><p>Finally, A outputs the majority of the shared keys f rom the set S key . Next, we outline the key intuition behind why A is likely to win in the KRAsecure game with a good probability. Let S Bob denote the set of query/response pairs made by B ob during the real execution of the key exchange protocol. For any iteration, there are two cases:</p><p>Observe that case 1 can occur in at most 2q iterations, as |S Bob | &#8804; 2q. Once this case takes place, the update phase will absorb at least one pair (que B , res B ) &#8712; S Bob into S que-res . For case 2, we note that when it occurs, there exists an alternate oracle H that remains consistent with both SA and S Bob . Due to p erfect correctness, the shared key computed during that iteration is guaranteed to be valid. Furthermore, since case 2 occurs in at least 2q + 1 iterations, this ensures that the majority of keys in S key are valid. However, in the case of GGM, the attack fails immediately, as the GGM itself implies the KRA-secure N IKE. Next, we explain the reasoning behind the failure of the attack.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Why the Attack Fails in GGM?</head><p>In contrast to the ROM, the GGM features two types of queries: labeling queries (e.g., x, G(x)) and addition queries (e.g., G(x), G(y), G(x + y)). Hence, to guarantee the validity of the shared key, we should define S Bob to include all group encodings p resent in the queries (both labeling and addition) along with their corresponding discrete logarithms (which Bob may not know). Consequently, for any iteration, there are three cases:</p><p>-Case 1 (Bad):</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>we have that if</head><p>que A = que B then res A = res B , and vice versa.</p><p>Observe that case 1 and case 3 can be handled similarly as above. However, case 2 may always occur, meaning it is impossible to find a GGM instance that is consistent with both SA and S Bob . This implies the aforementioned attack fails.</p><p>Upon further exploration, we observe that the occurrence of case 2 indicates that the adversary is able to generate a valid group element without knowing its discrete logarithm, i.e., without making labeling queries. Therefore, to advance the analysis, it is necessary to handle the NIKE and GGM with a more finegrained approach, ensuring that no query-efficient adversary can generate a valid group element without knowing its discrete logarithm. Specifically, the analysis aims to identify a hard problem related to the fine-grained NIKE and GGM, and prove that if an adversary, given the public keys pk 1 and pk 2 , succeeds in outputting a valid group element without making queries, then the hard problem does not hold anymore.</p><p>Solution in [JZW+24] and Its Limitations. <ref type="bibr">Recently,</ref><ref type="bibr">Ji et.al. [JZW+24]</ref> demonstrated that CDH-secure cryptographic groups with shorter group descriptions cannot exist in the generic group model with longer encodings. Specifically, they establish a somewhat black-box separation b etween KRA-secure NIKE schemes associated with shorter public keys and the GGMs with longer group encodings 3 . The key idea in their work is that, given the public keys pk 1 and 3 The formal primitive can be trivially constructed via shorter CDH-secure groups.</p><p>pk 2 , the extracted group element str can fall into one of two cases: (1) str is related to both pk 1 and pk 2 ; or (2) str is related to only one of the public keys but independent of the other. In the former case, Ji et.al. explain that str is typically a frequent query, which can be easily handled by repeatedly running the key generation algorithm on sufficiently many random inputs. In the latter case, where str is only relevant to only one public key (e.g., pk 1 ), they observe that pk 1 is the sole carrier of information about str. However, since the length of pk 1 is significantly shorter than that of str, it lacks the necessary capacity to encode all the information required for the recovery of str. Consequently, no query-efficient algorithm can extract str with a non-negligible probability.</p><p>Furthermore, the hard problem identified in [JZW+24] can be interpreted as follows: given any public key (which is shorter), no query-efficient algorithm can extract a valid group element (which is longer) except with negligible probability.</p><p>However, the hard problem in [JZW+24] faces an inherent limitation. Specifically, it holds only if the length of pk 1 is significantly shorter than that of str. This condition is guaranteed when both the KRA-secure NIKE and the GGM share the same security parameter. However, if NIKE and the GGM are associated with different security parameters, this condition may no longer hold. Exploring deeper, we observe that the reason for this limitation lies in the fact that "shorter" and "longer" are relative terms with respect to the security parameter. A public key considered "shorter" under a large security parameter may still carry enough information to extract a "longer" GGM encoding when the security parameter is small. Therefore, we argue that the hard problem in [JZW+24] is based on an unjustified assumption, and their analysis deviates from the conventional black-box separation<ref type="foot">foot_4</ref> .</p><p>Our Techniques. To establish a black-box separation between groups with nice admissible encodings and sparse GGMs, it is essential to identify a new and unconditional hard problem. We leverage the sparseness property of the GGM and define the hard problem as follows: for any query-efficient algorithm with access to the sparse GGM, which only receives the security parameter as input, it cannot output a new and valid group element except with negligible probability<ref type="foot">foot_5</ref> . By saying that a group element str is new, we mean that: (1) str has been used as an input in some addition queries made by t he algorithm, and (2) str has not appeared as the output of any prior queries. As discussed in <ref type="bibr">[Zha22]</ref>, the hardness of this problem holds unconditionally. Next we elaborate on the high-level strategy for int ergrating this hard problem into the black-box separation.</p><p>Roughly speaking, we show that if the adversary A, given inputs pk 1 and pk 2 , is able to generate a new group element with a good probability, then we can construct an extractor E which, given only the security parameter as input, can also output a new group element with a good probability.</p><p>The first technical challenge of our analysis is determining how the extractor E generates the tw o public keys. A natural approach would be:</p><p>Step 1: E randomly selects two private keys (sk 1 and sk 2 ), computes the corresponding public keys (pk i = KGen G (sk i )), and runs A G (pk 1 , pk 2 ).</p><p>Step 2: Whenever A issues an addition query using a new group element (denoted as str) as input, E outputs str.</p><p>Unfortunately, this approach fails immediately. Given the fact that the GGM is sparse, any new group element str generated by A should be related to either pk 1 or pk 2 . More precisely, str is likely to appear during the execution of KGen G (sk 1 ) or KGen G (sk 2 ) with high probability. As a result, if following the aforementioned approach, then the extractor would fail to output a new group element, even if A does so.</p><p>To overcome this challenge, we must adopt an alternative method for generating the public keys, leveraging admissible encodings as part of the solution. Specifically, we configure the NIKE as a KRA-secure NIKE with nice admissible encodings, meaning there exists an admissible encoding that maps from Z p to the public keys, where p is sufficiently large. Clearly, CDH-secure groups with nice admissible encodings implies such a NIKE. We then construct the extractor E as follows:</p><p>Step 1: E randomly selects two seeds (seed 1 and seed 2 ), computes the corresponding public keys via admissible encodings (pk i = AE G (seed i )), and runs A G (pk 1 , pk 2 ).</p><p>Step 2: Whenever A issues an addition query using a new group element (denoted as str) as input, E outputs str.</p><p>At this point, the second technical challenge emerges. Specifically, let S AE denote the set of the query/response pairs that arise during the execution of pk 1 = AE G (seed 1 ) and pk 2 = AE G (seed 2 ). Note that if the new group element generated b y the adversary A consistently falls within the set S AE , the extractor above will still fail to output a new group element.</p><p>To address this challenge, we refine the adversary's strategy. Specifically, we introduce a prepro cessing phase for the adversary, define as follows:</p><p>-Preprocessing. Given the inputs pk 1 and pk 2 , the adversary A runs the inverse of the admissible encodings, computing seed i = inv-AE G (pk i ). Subsequently, A recalculates pk i = AE G ( seed i ) and collects all associated query/response pairs into the set S que-res .</p><p>The adversary then proceeds as follows: before entering the iteration phase, it first executes the preprocessing phase.</p><p>Next, we explain why the preprocessing phase is effective. It is important to note that, following the preprocessing phase, the adversary A has already gathered all the query/response pairs contained in S AE . As a result, whenever A generates a new group element, it will also be new to E.</p><p>The above outline is not comprehensive; for detailed tec hnical explanations, please refer to Sect. 3.</p><p>The key idea of our analysis is that valid public keys can be generated through two distinct methods. In the context of NIKE with dense encodings, there are likewise two methods for generating public keys, one of which even does not require making any queries. Thus, our analysis can be naturally extended to establish a black-box separation between CDH-secure dense groups and sparse GGM. Furthermore, the hard problem above remains valid within the sparse generic bilinear group model (GBM), allowing all impossibility results to naturally extend to the sparse GBM as well.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.3.2">EC-GGM vs. Sparse GGM</head><p>To demonstrate that the elliptic curve generic group model (EC-GGM) is strictly stronger than the sparse generic group model (sparse GGM), we frame our goal within framework of indifferentiability. In particular, we establish t hat EC-GGM (denoted as ec-G) statistically implies sparse GGM (denoted as G); while the sparse GGM does not computationally imply EC-GGM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>EC-GGM Statistically Implies Sparse GGM.</head><p>We begin by explaining how ec-G implies G against statistical adversaries. Let ec-G = (ec-G L , ec-G A ) denote an EC-GGM corresponding to an elliptic curve defined over Z p , where each group element is represented as (u, b) &#8712; Z p &#215; {0, 1}. According to the Hasse bound, the encoding of EC-GGM is inherently dense. To extend the encoding length to m bits for the sparse GGM, we in troduce an additional oracle, the random permutation Perm over {0, 1} m , along with its inverse, Perm Inv . The labeling function is then defined as follows:</p><p>The addition algorithm is constructed by applying the inverse oracle Perm Inv . To demonstrate indifferentiability, we need to construct a simulator S that, with access to G, can accurately simulate both ec-G and (Perm, Perm Inv ).</p><p>To simulate ec-G, the simulator maintains a tabel to record the correspondence between the elliptic curve points P := (u, b) and str := G(x). An important property of the EC-GGM is that for any (u, b) &#8592; ec-G L (x) and (u , b ) &#8592; ec-G L (-x), it holds that u = u and b = 1b . Consequently, whenever the simulator records the tuple (P, str), where P := (u, b) and str := G(x), it additionally records (-P, G(-x)) with -P := (u, 1b). Notably, G(-x) can be obtained by making additional queries to G on str, even without knowledge of x.</p><p>To simulate Perm, the simulator S first checks whether the query corresponds to an elliptic curve point and whether there exists a corresponding G(x). If both conditions are satisfied, S just responds with G. If not, S responds with G(x) for a randomly sampled x &#8712; Z N or with a randomly chosen invalid string str, depending on whether the query corresponds to a valid point. The simulation of Perm Inv follows a similar strategy.</p><p>Remark 1.3. Careful readers might argue that, within the elliptic curve generic group model, adversaries could perform certain non-generic operations, such as oblivious sampling. As a result, EC-GGM extends the adversaries' capabilities, necessitating a reinterpretation of the hardness of certain security assumptions (e.g., DLOG or CDH).</p><p>Fortunately, due to statistical indifferentiability, the hardness of the CDH assumption remains intact. Specifically, we demonstrate that if a statistical yet query-efficient adversary exists that can break CDH in EC-GGM, we can construct a differentiator capable of breaking indifferentiability. More precisely, the differentiator acts as the challenger in the CDH game, interacts with the adversary, and distinguishes between the real and ideal worlds by observing whether the adversary successfully wins the CDH game.</p><p>Next, we demonstrate that even in the context of computationally bounded adversaries, it is not possible to construct an indifferentiable EC-GGM within a sparse GGM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Sparse GGM Does Not Computationally Imply EC-GGM.</head><p>Suppose we have a proposed construction of an elliptic curve generic group model derived from a sparse GGM, denoted as &#928; G := (L G , A G ). How can we demonstrate that a computationally bounded adversary is able to differentiate &#928; G from ec-G? A common approach is to identify a security game that remains secure in ec-G but can be easily broken in any construction of &#928; G .</p><p>Following the strategy outlined in [ZZ23], we define the security game as discrete logarithm identification (DLI), a variant of the discrete logarithm problem. Intuitively, the DLI game involves the following: given str := L(x), construct a probabilistic, efficient, and query-free circuit C such that C(x) accepts with high probability, while C(x ) overwhelmingly rejects for all x = x.</p><p>Next, we briefly recall the techniques from [ZZ23], where Zhang and Zhandry establish a separation between GGM and ROM against computationally bounded adversaries. Clearly, DLI is secure in GGM. To establish the separation, Zhang and Zhandry show that DLI can be easily broken in any group constructed from the ROM. In a nutshell, the adversary constructs the circuit C as follo ws: C(&#8226;) functions identically to L H (&#8226;), but without access to H, and accepts the input if the reconstructed result matches L H (x) (hardwired in C).</p><p>The key idea in [ZZ23] is to anticipate the queries that C will make to the random oracle model. This enables the adversary to make all sensitive queries in advance and hardcode the corresponding query/response pairs into C , thereby creating an oracle-free circuit. Zhang and Zhandry demonstrate that, with high probability, all sensitive queries can be collected as follows:</p><p>2. execute L H (xr) &#8592; A H (L H (x), L H (-r)) and A H (L H (xr), L H (r)), and collect all the queries into S que-res .</p><p>Next, we outline our approach for the incorporating aforementioned technique into the analysis within sparse GGM. Specifically, given an input L G (x), the adversary A collects all the queries as described above, hardwires both the query/response pairs and L G (x) into the circuit C, which operates identically to L G (&#8226; ), and then outputs C(&#8226;). We then analyze the probability of A wins. Let Q x be the set of all query/response pairs generated during the execution of L G (x)<ref type="foot">foot_7</ref> , for each pair (que, res) &#8712; Q x , there are four possible cases:</p><p>-Case 1: The label L G (x) does not depend on res at all; -Case 2: The label L G (x) depends on res, but res does not appear in the response when computing A G (L G (xr), L G (r)); -Case 3: The label L G (x) depends on res, which is obtained by making "labeling" query to G when computing A G (L G (xr), L G (r)); -Case 4: The label L G (x) depends on res, which is obtained by making "addition" query to G when computing</p><p>Next, we present the analysis for handling eac h of the four cases.</p><p>For (que, res) in case 1 (same as in [ZZ23]), referred to as a non-sensitive query, since L G (x) does not depend on res, we can replace the resp onse to que with a uniformly sampled string without affecting the final outcome.</p><p>For (que, res) in case 2 (same as in [ZZ23]), referred to as a sensitive but frequent query, since the computation of A G (L G (xr), L G (r)) does not obtain res by making queries, res must be extracted from the inputs, i.e., L G (xr) and/or L G (r). This indicates that, with high probability, (que, res) &#8712; Q x &#8745;(Q x-r &#8746; Q r ). Moreover, since x, xr and r are pairwise indep endent, which means that Q x &#8745; Q x-r and Q x &#8745; Q r only contains "frequent" queries. Therefore, this query/response tuple can be collected by running L G (&#8226;) on sufficiently many random inputs.</p><p>For (que, res) in case 3 (same as in [ZZ23]), referred to as a sensitive labeling query, we can collect all labeling queries made during the computation of</p><p>For (que, res) in case 4 (different from [ZZ23]), referred to as a sensitive addition query, where que = (str 1 , str 2 ) consists of two valid group elements. Although res appears in this query, collecting this kind of query is not usually useful for our purpose. Specifically, when executing L G (x), the algorithm may only issue labeling queries at points (x 1 , . . . , x q ), whereas S que-res might only contain query/response pairs in the form of addition, such as (G(y i ), G(z i ), G(y i + z i )), without explicitly knowing y i or z i . Consequently, during the reconstruction of L G (x), the algorithm may issue labeling queries at certain points, but C is unable to identify which tuple corresponds to the correct response, resulting in the reconstruction failure.</p><p>To overcome this technical challenge, we once again leverage the sparseness property of the GGM. Note that the occurrence of the query in case 4 indicates that the adversary is able to generate a new and valid group element without knowing its discrete logarithm. Thus, by applying the same analysis as outlined in Sect. 1.3.1, we can construct an extractor that, given only the security parameter as input, is able to produce a new and valid group element whenever the query in case 4 occurs. This demonstrates that the query in case 4 occurs with negligible probability.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Preliminaries</head><p>Notations. In this paper, let &#955; &#8712; Z denote the security parameter. We use x||y to represent the concatenation of the strings x and y. For a finite set S, we denote a random sample s from S according to the uniform distribution as s $ &#8592; S, and the size of S as |S|. For a probabilistic algorithm Alg, we overload the notation and w rite y $ &#8592; Alg(I) to denote that the variable y is obtained by running Alg on input I, where I may be a tuple I = (I 1 , ..., I n ). If Alg is deterministic, we use the notation "&#8592;" instead of " $ &#8592;". A positive function negl(&#8226;) is said to be negligible if, for all positive polynomial poly(&#8226;), there exists a constant &#955; 0 &gt; 0 such that for all &#955; &gt; &#955; 0 , it holds that negl(&#955;) &lt; 1/poly( &#955;). We say that a function &#961;(&#8226;) is noticeable in &#955; if its inverse, 1/&#961;(&#955;), is polynomial in &#955;.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Groups with Admissible E ncodings</head><p>In this work, we treat groups with admissible encodings as a cryptographic primitive. We first recall the formal definition of a primitive given by [RTV04].</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 2.1 (Cryptographic Primitive [RTV04]). A primitive P is a pair</head><p>, , where F is a set of functions f : {0, 1} * &#8594; {0, 1} * specifying correctness property, and R is a relation on pairs f, , with f &#8712; F and A an adversarial machine, specifying security property.</p><p>-Efficient implementation. A function f is said to implement P or be an implementation of P if f &#8712; F. A n efficient implementation of P is an implementation of P that is computable in polynomial time. -Secure implementation. An adversarial machine A is said to P-break f &#8712; F if f, . A secure implementation of P is an implementation of P such that no ppt adversarial machine can P-break f . We say that the primitive P exists if there is an efficient and secure implementation of P.</p><p>To ensure correctness, we consider cryptographic groups that support an efficiently computable encoding from Z p (for some integer p) into the group. Various formulations of such admissible encodings appear in the literature <ref type="bibr">[BF03,</ref><ref type="bibr">BCI+10,</ref><ref type="bibr">LPS23]</ref>; in this work, we adopt the definition from <ref type="bibr">[LPS23]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 2.2 (Admissible Encodings). A function AE : S T between two finite sets is an admissible encoding w ith respect to if it satisfies the following conditions:</head><p>Computable: The function AE is computable in polynomial time. Sampleable: Given any element t in the image of AE, one can efficiently compute its full preimage inv-AE(t).</p><p>-regular: For any t &#8712; T , the size of the preimage |inv-AE(t)| is at most , where is a samll constant.</p><p>Remark 2.1. The definition of admissible encodings in [BCI+10] differs slightly from that in Definition 2.2. Specifically, it requires that for uniformly distributed inputs s &#8712; S, the output distribution AE(s) is statistically close to uniform over T . However, most known admissible encodings-such as Icart's encoding [Ica09, FT10]-do not produce uniformly distributed outputs. In this w ork, we adopt the definition from <ref type="bibr">[LPS23]</ref>, and note that the formulation in [BCI+10] can be seen as a special case of Definition 2.2 with = 1 .</p><p>When working with admissible encodings for cryptographic groups, the target set T is typically identified with the set of group elements. A key application of admissible encodings is oblivious sampling, which requires that the encoding's image covers a noticeable fraction of the set T . This, in turn, requires the domain to b e sufficiently large. Specifically, we say an admissible encoding AE for a group G is nice if: (1) AE maps from Z p to G, and (2) the ratio p |G| is not small. As shown in [Ica09], all elliptic curve groups admit nice admissible encodings. We now formalize the notion of CDH-secure groups equipped with nice admissible encodings.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 2.3. (CDH-Secure Group with Nice Admissible Encodings).</head><p>The CDH-secure group with nice admissible encoding with respect to a constant and a polynomial poly AE , denoted ae-P CDH , is defined as a pair ae-F CDH , ae-R CDH , satisfying the following conditions:</p><p>1. The set ae-F CDH consists of group-generation functions f that, on input a security parameter, f outputs the description of a finite cyclic group along with an additional function. Specifically, we write (G, g, N,</p><p>where G is a cyclic group of &#955;-bit prime order N , g is a generator of G, and AE : Z p G is a -regular admissible encoding, with p &#8805; N/poly AE (&#955;).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">The pair f,</head><p>belongs to ae-R CDH for a function f &#8712; ae-F CDH and a ppt adversarial machine A if there exists a polynomial poly A (&#8226;) such that</p><p>for infinitely many values of &#955;, where (G, g, N, AE) &#8592; f (1 &#955; ) and x 1 , x 2 are chosen uniformly from Z N .</p><p>We say that a CDH-secure group with nice admissible encoding ae-P CDH exists if there is a function f &#8712; ae-F CDH such that no ppt adversarial machine A satisfies f, ae-R CDH .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Dense Groups</head><p>We also consider cryptographic groups with dense encodings and formalize this primitive following the definition style in</p><p>[RTV04]. Definition 2.4. (CDH-Secure Dense Group). The CDH-secure dense group, denoted d-P CDH , is defined as a pair d-F CDH , d-R CDH , satisfying the following conditions: 1. The set d-F CDH consists of group-generation functions f that, on input of a security parameter &#955;, output the description of a finite cyclic group with a dense encoding. Specifically, we write (G, g, N, m) &#8592; f (1 &#955; ), where G is a cyclic group of &#955;-bit prime order N , g is a generator of G, and m is the length of the group encoding. Here, m satisfies mlog N &#8804; &#920;(log &#955;), meaning that each group element in G can be represented as an m-bit string. 2. The pair f, belongs to d-R CDH for a function f &#8712; d-F CDH and a ppt adversarial machine A if there exists a polynomial poly A</p><p>for infinitely many values of &#955;, where (G, g, N, m) &#8592; f (1 &#955; ) and x 1 , x 2 are chosen uniformly from Z N .</p><p>We say that a CDH-secure dense group d-P CDH exists if there is a function</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Idealized Mo dels</head><p>In this subsection, we introduce the idealized models discussed in this pap er, including the Generic Group Model (GGM) <ref type="bibr">[Sho97]</ref>, the Generic Bilinear Group Model (GBM) <ref type="bibr">[BB04]</ref>, and the Elliptic Curve Generic Group Model (EC-GGM) [GS22]. In each model, all entities, including the adversary and the challenger, have access to the corresponding oracle. Below, we specify the behavior of the oracle in each idealized model. In this work, we further say that the generic group model G N,m is dense if mlog N &#8804; &#920;(log &#955;). Otherwise, we refer to it as sparse.</p><p>Definition 2.6. (Generic Bilinear Group Model <ref type="bibr">[BB04]</ref>). For a given security parameter &#955; &#8712; N, let I ZN ,Si denote the set of all injections from Z N to S i , where N is a &#955;-bit prime number and S i = {0, 1} mi for i &#8712; {1, 2, T}.</p><p>The generic bilinear group model B is an idealized model that samples random injections &#963; 1 , &#963; 2 , and &#963; T from I ZN ,Si , respectively, and provides seven oracles: B 1-label , B 1-add , B 2-label , B 2-add , B T-label , B T-add and B map . Concretely, -The labeling oracle B i-label , for i &#8712; {1, 2, T}, takes as input x &#8712; Z N and returns &#963; i (x); -The addition oracle B i-add , for i &#8712; {1, 2, T}, takes as input strings str, str &#8712; S i , and behaves as follows: if there exist x, x &#8712; Z N such that &#963; i (x) = str and &#963; i (x ) = str , the oracle returns &#963; T (x + x ); otherwise, it returns &#8869;. -The bilinear map oracle B map takes as input strings str 1 &#8712; S 1 and str 2 &#8712; S 2 , and behaves as follows: if there exist x 1 , x 2 &#8712; Z N such that &#963; 1 (x 1 ) = str 1 and &#963; 2 ( x 2 ) = str 2 , the oracle returns &#963; T (x 1 &#8226; x 2 ); otherwise, it returns &#8869;.</p><p>In this work, we consider only symmetric generic bilinear group models, meaning that B 1-label = B 2-label and B 1-add = B 2-add . We make the functions N , m S and m T explicit<ref type="foot">foot_8</ref> , and refer to the model as</p><p>We </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">Indifferentiability</head><p>The framework of indifferentiability is proposed by M aurer, Renner, and Holenstein [MRH04], which formalizes a set of necessary and sufficient conditions for securely replacing one cryptosystem with another in an arbitrary environment. This framework is used to justify the structural soundness of various cryptographic primitives, including hash functions [CDMP05,DRS09], block ciphers [ABD+13,CHK+16,DSSL16,GWL23], domain extenders [CDMS10], authenticated encryption with associated data <ref type="bibr">[BF18]</ref>, and public-key cryptosystems <ref type="bibr">[ZZ20]</ref>. It can also be used to study the relationship b etween idealized models [ZZ23]. In the following, we proceed to the definition of indifferentiability:</p><p>A cryptosystem &#931; consists of a set of algorithms. Here, &#931; is accessible via two interfaces &#931;.hon and &#931;.adv, where &#931;.hon provides an honest interface through which the system can be a ccessed by all parties in a black-box manner, and &#931;.adv models the adversarial access to &#931;.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 2.8 (Indifferentiability [MRH04]</head><p>). Let &#931; 1 and &#931; 2 be two cryptosystems and S be a simulator. The indifferentiability advantage of a differentiator D against (&#931; 1 , &#931; 2 ) with respect to S is</p><p>where games Real &#931;1,D and Ideal &#931;2,S,D are defined in Fig. <ref type="figure">1</ref>. We say &#931; 1 is indifferentiable from &#931; 2 , if there exists an efficient simulator S such that for any efficient differentiator D, the advantage above is negligible. Moreover, we say &#931; 1 is statistically indifferentiable from &#931; 2 , if there exists an efficient simulator such that, for any unbounded differentiator D, the advantage above is negligible. Below, we also use the notations in <ref type="bibr">[BF18]</ref> and consider the definition above to two systems with interfaces as:</p><p>where F 1 and F 2 are two ideal objects sampled from their distributions and &#928; F1 is a construction of F 2 by calling F 1 . MRH prove the composition theorem for the framework of indifferentiability; for simplify, we give a game-based formalization from [RSS11].</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Theorem 2.1 (Composition Theorem [MRH04]</head><p>). Let &#931; 1 := (&#928; F1 , F 1 ) and &#931; 2 := (F 2 , F 2 ) be two systems that &#931; 1 is indifferentiable from &#931; 2 with respect to a simulator S, then &#931; 1 is as secure as &#931; 2 for any single-stage game. More concretely, let Game be a single-stage game, then for any adversary A, there is an adversary B and a differentiator D such that</p><p>The proof of Thm. 2.1 is straightforward; due to space limit, we skip it here. Next, we give the formal definition of the separation between tw o idealized models in the framework of indifferentiability against computational adversaries.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 2.9 (Computational Indifferentiable Separation [MRH04]).</head><p>Let &#931; 1 , &#931; 2 be two idealized models, we say &#931; 2 is computationally indifferentiably separated from &#931; 1 if for any efficient algorithm &#928; and any efficient simulator S, there exists an efficient differentiator D &#928;,S and a noticeable function &#961; such that</p><p>Observe that, if an idealized model &#931; 2 is computationally indifferentiably separated from another idealized model &#931; 1 , it means that, we cannot build a scheme &#928; &#931; 1 such that &#928; &#931;1 is indifferentiable from &#931; 2 , even under arbitrarily strong computational assumptions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Impossibility of Groups with Admissible Encodings</head><p>In this section, we elaborate the main constraint of the sparse GGM/GBM. We demonstrate that any the cryptographic group with nice admissible encodings that is CDH-secure, denoted as ae-P CDH , cannot exist within the sparse GGM/GBM. Roughly speaking, to establish the black-box separation, the standard approach is t o build an adversary that, while computationally unbounded, is query-efficient and capable of breaking the CDH game with respect to any construction of ae-P CDH in the sparse GGM/GBM.</p><p>For easier readability, our strategy follows the one in [IR89]. We introduces an intermediary primitive, i.e. non-interactive key exchange (NIKE) with nice admissible encoding that is secure against key-recovery attack (KRA-secure), denoted as ae-P NIKE , and prove that:</p><p>ae-P CDH implies ae-P NIKE ; -ae-P NIKE does not exist in the sparse GGM/GBM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Non-interactive Key Exchange with Nice AE</head><p>In this section, we present the formal definition of the KRA-secure noninteractiv e key exchange with nice admissible encoding.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Definition 3.1 (NIKE with Nice AE).</head><p>A KRA-secure non-interactive key exchange protocol with nice admissible encoding, denoted ae-P NIKE , is define d as a pair ae-F NIKE , ae-R NIKE :</p><p>1. The set ae-F NIKE consists of functions f , each associated with a constant and a polynomial poly AE in &#955;. For a given input security parameter, f outputs the description of a non-interactive key exchange protocol along with an additional function. Specifically, w e write (KGen, SHK, AE) &#8592; f (1 &#955; ), where algorithms are associated with SK, PK, and K.</p><p>-SK, PK, and K are the private-key space, public-key space and shared-key space, respectively, satisfying that SK := Z N , where N is a &#955;-bit integer. -The public-key generation function KGen : SK &#8594; PK is an injection, for generating a public key pk &#8712; PK from a randomly chosen private key sk &#8712; SK. -The shared-key generation function SHK : PK&#215;SK &#8594; K&#8746;{&#8869;} for generating a shared key shk &#8712; K&#8746;{ &#8869;}, where &#8869; indicates a failed computation. -The encoding function AE : Z p &#8594; PK is a -regular admissible encoding from Z p to the codomain o f KGen, with p &#8805; N/poly AE (&#955;).</p><p>Concretely, for randomly chosen sk $ &#8592; SK and sk $ &#8592; SK, compute pk &#8592; KGen(sk) and pk &#8592; KGen(sk ). We write shk &#8592; SHK(pk , sk) and shk &#8592; SHK(pk, sk ). The protocol is required t o achieve perfect correctness, meaning that:</p><p>2. For a function f = (KGen, SHK, AE) &#8712; ae-F NIKE and a ppt (adversarial) machine A, we define f, ae-R NIKE if A can break the security property of f against key-recovery attack (KRA). Specifically, there exists a polynomial poly A (&#8226;) such that:</p><p>for infinitely many values of &#955;. Here, for randomly chosen sk $ &#8592; SK and sk $ &#8592; SK, compute pk &#8592; KGen(sk) and pk &#8592; KGen(sk ), respectively.</p><p>We say ae-P NIKE exists if there is a function f &#8712; ae-F NIKE such that no ppt adversarial machine A satisfies f, ae-R NIKE .</p><p>It is apparent that ae-P CDH implies ae-P NIKE by defining KGen(x) := g x and SHK(pk, sk ) := pk sk . Therefore, it is sufficient to prove that ae-P NIKE does not exist in the sparse GGM.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">ae-P NIKE Does Not Exist in the Sparse GGM/GBM</head><p>In this section, we establish the black-box separation between ae-P NIKE and the sparse GGM. Specifically, we construct an adversary that, while computationally unbounded, is query-efficient and capable of breaking the KRA-secure game with a noticeable probability. The corresponding proof for the sparse GBM proceeds analogously and is deferred to the full version of this paper. Theorem 3.1. Let G N,m be a generic group model such that mlog N &#8805; &#969;(log &#955;). Let &#928; GN,m = (KGen GN,m , SHK GN,m , AE GN,m ) be any non-interactive key exchange protocol with nice admissible encoding par ameterized by a constant and a polynomial poly AE . Then there exists an adversary A and two polynomials poly 1 and poly 2 such that:</p><p>-A makes poly 1 queries to G N,m ; -A breaks the KRA-secure game with advantage 1 poly 2 . Proof. To establish the proof, we present a formal description of the a dversary A, as illustrated in Fig. <ref type="figure">2</ref>. Here, we specify that any algorithm in &#928; GN,m makes at most q queries, where q is a polynomial. We then clarify some undefined notions: By (que 1 , res 1 ), . . . , (que q , res q ) query &#8592;-KGen GN,m (x), we mean that when running KGen GN,m (x), the algorithm makes queries (que 1 , . . . , que q ) to the oracle G N,m , and obtains (res 1 , . . . , res q )<ref type="foot">foot_10</ref> .</p><p>Trivial to note that the adversary A GN,m is query-efficient. We next prove that the adversary A GN,m can successfully guess the valid shared key with a noticeable probability. Let S Bob-L denote the set of all valid group encodings that appear when running the algorithms KGen GN,m (sk 2 ) and SHK GN,m (pk 1 , sk 2 ); these encodings are either the responses of labeling/addition queries or the valid inputs of the addition queries. Clearly, |S Bob-L | &#8804; 6q, since each algorithm makes at most q queries to G N,m . We then define:</p><p>Note that in each iteration, if the encoding pairs in SA are consistent with S Bob , the shared key computed in that iteration would be correct. Specifically, in such a context, there exists another instance of GGM that is consistent with both the simulation view of A and the real view of user Bob. The perfect correctness of ae-P NIKE guarantees that the shared key computed in this iteration must be equal to the true key computed by Bob. However, without knowledge of sk 2 , SA might be inconsistent with S Bob with high probability. In fact, there are three events:</p><p>-Event 1: There exist (x, str) &#8712; SA and (x , str ) &#8712; S Bob such that x = x but str = str . -Event 2: There exist (x, str) &#8712; SA and (x , str ) &#8712; S Bob suc h that x = x but str = str . -Event 3: For any (x, str) &#8712; SA and (x , str ) &#8712; S Bob , we have that i f x = x then str = str , and vice versa.</p><p>We immediately observe that Event 1 occurs at most 6q times, since the updating phase would eliminate at least one pair in S Bob with each occurrence. Next we show that, with a noticeable probability, Event 3 would deduce the valid shared key. According to the adversary, decipted in Fig. <ref type="figure">2</ref>, we note that the set SA &#8746; S que-res responds to the labeling queries properly. For the addition queries, say que = (str 1 , str 2 ), there are two cases:</p><p>-Case 1: SA &#8746; S que-res covers (x 1 , str 1 ), (x 2 , str 2 ), and ( x 1 + x 2 , str 3 ); -Case 2: either str 1 or str 2 is not collected in &#732; S A &#8746; S que-res .</p><p>For the former case, SA &#8746; S que-res responds to the addition query properly; for the latter case, SA &#8746; S que-res always responds with &#8869;, which means that if both str 1 and str 2 are valid group elements then the response is invalid. Therefore, the only bad case that prevents Event 3 from guessing a valid shared key is that the adversary A generates valid group elements str without making labeling queries (i.e., without knowing the corresponding discrete logarithm). Besides, we observe that, when Event 2 occurs, the adversary also generates valid group elements without making labeling queries. Hence, if the probability of such a bad case is small, then the adversary A GN,m can break the KRA-secure game and output the valid shared key with a good probability.</p><p>According to the description of the adversary A GN,m , we immediately observe that, A GN,m aborts if either pk 1 or pk 2 has no preimage with respect to AE GN,m . To analyze the probability of such a bad event , we define a tuple (G N,m , sk 1 , sk 2 ) is invertible if the following conditions are satisfied:</p><p>Due to the fact that AE GN,m is a -regular admissible encoding from Z p to the N valid public keys, there are at least p public keys with preimages under AE GN,m . Each such public key corresponds to a unique private key. Therefore, we have that</p><p>where the probability of over the sampling of the instance of the GGM and private keys.</p><p>Moreover, we observe that once the randomness of the adversary is fixed, then the algorithm A GN,m (pk 1 , pk 2 ) is deterministic. Let r be a nonce, we say the adversary performs good with respect to the tuple (G N,m , sk 1 , sk 2 , r), if the adversary A GN,m (pk 1 , pk 2 ), utilizing r as its source of randomness, is unable to generate a valid group element without knowing the discrete logarithm. Here, pk i represents the public key generated from the private key sk i .</p><p>Next, we introduce the concept of "good" for the tuple (G N,m , sk 1 , sk 2 ). More formally, we say a tuple (G N,m , sk 1 , sk 2 ) is good if</p><p>where pk 1 &#8592; KGen GN,m (sk 1 ) and pk 2 &#8592; KGen GN,m (sk 2 ), and the probability is over the internal randomness of A. Analogously, we say a tuple (G N,m , sk 1 , sk 2 ) is bad if</p><p>Pr[A performs good with respect to (G N,m , sk 1 , sk 2 , r)] &lt; 1 2 .</p><p>For clarity, we denote these three events in which the tuple (G N,m , sk 1 , sk 2 ) is invertible, good or bad as Invertible, Good and Bad, respectively. Due to the perfect correctness of &#928; GN,m , it is apparent that the probability A wins the KRAsecure game, conditioned on Good &#8743; Invertible, is &#8805; 1 2 . Thus, the probability of A wins can be bounded by</p><p>Therefore, it suffices to prove that Pr[Good|Invertible] is noticeable. To finalize the proof, we utilize the sparsity property of the GGM. As discussed in <ref type="bibr">[Zha22]</ref>, given access to a sparse GGM, any algorithm that receives only the security parameter as input cannot output a valid group element without knowledge of the corresponding discrete logarithm except with negligible probability 11 . For ease of exposition, we refer to this event as the algorithm outputs a new group element. More precisely, we say an algorithm E outputs a new group element str, if (1) str has been used as an input to some addition queries made by E, and (2) str has not appeared as the output of any previous queries (whether labeling or addition) made by E.</p><p>We note that, if Pr[Good|Invertible] is small, then, conditioned on Invertible, the adversary A is likely to generate a new group element with a g ood probability. Consequently, our proof strategy proceeds in two steps:</p><p>1. Construct an extractor E that takes only the s ecurity parameter as input; 2. Demonstrate that, if Pr[Good|Invertible] is small, then E outputs a new group element with a good probability.</p><p>We now present the formal description of the e xtractor E, as depicted in Fig. <ref type="figure">3</ref>. The extractor E takes only the security parameter as input, randomly selects two seeds (i.e., seed 1 and seed 2 ), and computes the corresponding public keys (i.e., pk 1 and pk 2 ) using admissible encodings. Following this, the extractor proceeds in a manner closely resembling the adversary in the KRA-security game, as shown in Fig. <ref type="figure">2</ref>. Specifically, the extractor proceeds the preprocessing phase and the iteration phase. During the simulation process of each iteration phase, if a new and valid group element appears, either in SA or as an input of some addition queries, then E outputs this element (highlighted in red). Observe that the randomness of E is composed of three components: seed 1 , seed 2 , and r, where r represents the randomness used during the iteration phase. Furthermore, once the GGM instance and the two seeds are fixed, the extractor E proceeds identically to the adversary A in the KRA-security game, continuing until E successfully outputs a new and valid group element.</p><p>Next, we analyze the probability of the extractor wins, which occurs when E outputs a new and valid group element. Let G N,m denote the GGM to which the extractor E has access. Let seed 1 and seed 2 denote the seeds sampled by the extractor. We define the seed pair, (seed 1 , seed 2 ), to be bad with respect to G N,m , if the tuple (G N,m , sk 1 , sk 2 ), deduced from (seed 1 , seed 2 ), is a Bad and Invertible tuple. More precisely, when we say a tuple (G N,m , sk 1 , sk 2 ) deduced from (seed 1 , seed 2 ) with respect to G N,m , we mean that KGen GN,m (sk 1 ) = AE GN,m (seed 1 ) and KGen GN, m (sk 2 ) = AE GN,m (seed 2 ).</p><p>For clarity, we refer to this event as BadSeed in the following analysis. According to the definition of Bad, we note that if E fortunately samples a bad seed pair, i.e., BadSeed occurs, then E wins with a good probability. Concretely,</p><p>Next, we analyze the probability of BadSeed with respect to any specific GGM instance. To facility the analysis, we introduce several concepts relevant to the generic group model. For any instance of the GGM, denoted as G N,m , we define Q GN,m as the set of all private keys such that for any private key sk &#8712; Q GN,m , the corresponding public ke y (denoted pk = KGen GN,m (sk)) has valid preimages under the admissible encodings. Moreover, due to that the admissible encodings is -regular, it is apparent that</p><p>We then categorize all instances of the GGM into two types: good GGM instances and bad GGM instances. Specifically, let G N,m be a GGM instance, and Q GN,m := {sk 1 , . . . , sk t }. Let (i, j) be a pair where i, j &#8712; [1, t] 12 . We classify G N,m as a bad instance if there are more than 1 2 2 &#8226;t 2 pairs such that each induces a Bad and Invertible tuple. More precisely, when we say a pair such as (i * , j * ) induces a Bad and Invertible tuple, we mean that the tuple (G N,m , sk i * , sk j * ) is bad and invertible. Analogously, we classify G N,m as a good instance if there are at most 1 2 2 &#8226;t 2 pairs, each of which induces a Bad and Invertible tuple. Therefore, for any good GGM instance, there are at least (1 -1 2 2 ) &#8226; t 2 tuples that are both Good and Invertible.</p><p>Furthermore, for any GGM instance G N,m , if a tuple (G N,m , sk 1 , sk 2 ) is invertible, then there are at least one seed pair, such as (seed 1 , seed 2 ), from which (G N,m , sk 1 , sk 2 ) can be deduced from. Consequently, for any fixed GGM instance, we have that, Num of bad and invertible tuples &#8804; Num of bad seed pairs. We are now prepared to establish the lower bound on the probability of the extractor E wins when accessing G N,m , conditioned on G N,m being a bad GGM instance. Concretely, let t be the size of Q GN,m , we have that,</p><p>Pr[E GN,m wins|G N,m is bad] &#8805; Pr[E GN,m wins|BadSeed] &#8226; Num of bad seed pairs Num of all seed pairs &#8805; 1 2 &#8226; Num of bad and invertible tuples Num of all seed pairs &#8805; 1 2 &#8226; 1 2 2 &#8226; t 2 p 2 &#8805; 1 4 4 . Next, we analyze the probability of E wins, where the probability is also considered o ver the uniform sampling of the GGM instance. Specifically, Pr[E wins] &#8805; Pr[E GN,m wins|G N,m is bad] &#8226; Num of bad GGM instances Num of all GGM instances &#8805; 1 4 4 &#8226; Num of bad GGM instances Num of all GGM instances . For clarity in exposition, we define TotalGGM as the set of all GGM instances, BadGGM as the subset of bad GGM instances, and GoodGGM as the subset of good GGM instances. Moreover, we define BadRatio := |BadGGM| |TotalGGM| ; GoodRatio := |GoodGGM| |TotalGGM| . Thus, the probability of the extractor E wins is bounded by Pr[ E wins] &#8805; 1 4 4 &#8226; BadRatio. According to the sparsity property of the GGM, we know that the probability of E outputs a new group element is negligible, indicating that BadRatio is negligible. T herefore, it suffices to prove that, if Pr[Good|Invertible] is not noticeable, then BadRatio is not negligible. Claim. If Pr[Good|Invertible] &#8804; 1 4 2 &#8226; 1 poly AE (&#955;) 2 , then BadRatio &#8805; 1 2 . We proceed our analysis by contraposition. Specifically, we prove that if BadRatio &lt; 1 2 , then Pr[Good|Invertible] &gt; 1 4 2 &#8226; 1 poly AE (&#955;) 2 . By definition, we have that Pr[Good|Invertible] = Pr[Good &#8743; Invertible] Pr[Invertible] = &#931; GN,m Num of Good and Invertible tuples w.r.t. G N,m &#931; GN,m Num of Invertible tuples w.r.t. G N,m , where the summation &#931; GN,m is taken over all GGM instances. Note that for any GGM instance G N,m , since the size of the private-key space is N , there are at most N 2 relevant tuples associated with G N,m . This implies that &#931; GN,m Num of Invertible tuples w.r.t. G N,m &#8804; |TotalGGM| &#8226; N 2 . Furthermore, by the definition of good GGM instance, for any good GGM instance G N,m , there are at least (1 -1 2 2 ) &#8226; t 2 tuples that are both Good and Invertible tuples associated with G N,m . Therefore, we know that &#931; GN,m Num of Good and Invertible tuples w.r.t. G N,m &#8805; |GoodGGM|&#8226;(1-1 2 2 )&#8226;t 2 , where t = |Q GN,m | &#8805; p . Thus, we hav e that Pr[Good|Invertible]</p><p>Therefore, we conclude that, if BadRatio &lt; 1 2 , then</p><p>Combining together, we establish the entire proof.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Impossibility of Dense Groups</head><p>In this section, we focus on cryptographic groups with dense group encodings, demonstrating that any such group that is CDH-secure, denoted as d -P CDH , cannot exist within the sparse GGM/GBM. Following a similar strategy as in Sect. 3, we introduce an intermediary primitive, i.e. non-interactive key exchange (NIKE) with dense public-key space, which is secure against key-recove ry attack (KRA-secure), denoted as d-P NIKE , and prove that:</p><p>d-P CDH implies d-P NIKE ; -d-P NIKE does not exist in the sparse GGM/GBM. 1. The set d-F NIKE consists of functions f that, on input of a security parameter, output the description of a non-interactive key exchange protocol. Specifically, we write (KGen, SHK) &#8592; f (1 &#955; ), where algorithms are associated with SK, PK, and K.</p><p>-SK, PK, and K are the private-key space, public-key space and shared-key space, respectively, satisfying that SK := Z N , where N is a &#955;-bit integer, and log |PK|log |SK| &#8804; &#920;(log &#955;); -The public-key generation function KGen : SK &#8594; PK is an injection, for generating a public key pk &#8712; PK from a randomly chosen private key sk &#8712; SK. -The shared-key generation function SHK : PK&#215;SK &#8594; K&#8746;{&#8869;} for generating a shared key shk &#8712; K&#8746;{ &#8869;}, where &#8869; indicates a failed computation.</p><p>Concretely, for randomly chosen sk $ &#8592; SK and sk $ &#8592; SK , compute pk &#8592; KGen(sk) and pk &#8592; KGen(sk ). We write shk &#8592; SHK(pk , sk) and shk &#8592; SHK(pk, sk ). The protocol is required t o achieve perfect correctness, meaning that:</p><p>Pr shk = &#8869; &#8744; shk = &#8869; &#8744; shk = shk = 0.</p><p>2. For a function f = (KGen, SHK) &#8712; d-F NIKE and a ppt (adversarial) machine A, we define f, d-R NIKE if A can br eak the security property of f against key-recovery attack (KRA).</p><p>We say d-P NIKE exists if there is a function f &#8712; d-F NIKE such that no ppt adversarial machine A satisfies f, d-R NIKE .</p><p>It is apparent that d-P CDH implies d-P NIKE by defining KGen(x) := g x and SHK(pk, sk ) := pk sk . Therefore, it is sufficient to prove that d-P NIKE does not exist in the sparse GGM/GBM. Due to space limit, we leave the proof in the full version of this paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Sparse GGM vs. EC-GGM</head><p>In this section, we analyze the relationship between the Elliptic Curve Generic Group Model (EC-GGM) and t he sparse GGM within the framework of indifferentiability.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.1">EC-GGM Statistically Implies Sparse GGM</head><p>We describe how to build a sparse generic group from an EC-GGM equipped with an independent random permutation. Denote Orac as the tuple (ec-G N , Perm), where ec-G N = (ec-G label N , ec-G add N ) is an elliptic curve generic group model over a point set E, and Perm is a random permutation over {0, 1} m . Define &#916;m := m -( log p + 1), the construction &#928; Orac = (L Orac , A Orac ) is depicted in Fig. <ref type="figure">4</ref>. Theorem 5.1. Let G N,m be a sparse generic group model. The scheme &#928; Orac is indifferentiable from G N,m . More precisely, there exists a simulator S such that for all q-query differentiator D, we have</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Adv indif</head><p>&#928; Orac ,GN,m,S,D &#8804; 36q 2 N + 12q 2 + q 2 &#955; + q 2 &#969;(log &#955;) . The simulator makes at most &#955;q queries to G N,m .</p><p>Due to space limit, we leave the proof in the full version of this paper.  Our strategy fundamentally builds on the analytical framework develop ed by Zhandry and Zhang [ZZ23]. Due to space limit, we leave the proof in the full version of this paper.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0"><p>Here, p denotes the prime modulus used in elliptic curve groups, where the curve is typically defined b y the equation y</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1"><p>= x</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2"><p>+ Ax + B mod p.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_3"><p>Since Sque-res is only a subset of H, it is possible that SA may be inconsistent with the random oracle H.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_4"><p>The conventional black-box separation framework establishes impossibility r esults unconditionally, whereas [JZW+24] demonstrates impossibility only under certain unjustified conditions.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_5"><p>The probability is taken over the sampling of the GGM instances and the internal randomness of the algorithm.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_6"><p>randomly sample r, r 1 , . . . , r n 6 , execute L H (r), L H (-r), L H (r 1 ), . . . , L H (r n ), and collect all the queries into S que-res ;6 Here n is a sufficiently large integer.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_7"><p>Without loss of generality, we assume that L G (&#8226;) only makes labeling queries.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_8"><p>GBM is symmetric indicating that m1 = m2, and we redenote by m S the encoding length of source group.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_9"><p>For any u &#8712; Zp there are at most two points on the curve with u as their x-coordinate, namely, (u, &#177;y) for some y. To simplify, point compression is applied, and we denote the point (u, y) by the pair(u, b), where b = 0 if y is even and b = 1 if y is odd.</p></note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_10"><p>As explained above, we assume that the algorithms KGen G N,m and AE G N,m only make labeling queries.</p></note>
		</body>
		</text>
</TEI>
