- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources4
- Resource Type
-
02000020000
- More
- Availability
-
22
- Author / Contributor
- Filter by Author / Creator
-
-
Wang, Yichen (4)
-
He, Tianxing (2)
-
Tsvetkov, Yulia (2)
-
Bohan_Hou, Abe (1)
-
Cai, T. Tony (1)
-
Chuang, Yung-Sung (1)
-
Feng, Shangbin (1)
-
Harchaoui, Zaid (1)
-
He, Niao (1)
-
Hou, Abe (1)
-
Khashabi, Daniel (1)
-
Liu, Xiaoming (1)
-
Pu, Xiao (1)
-
Shen, Chao (1)
-
Shen, Lingfeng (1)
-
Song, Le (1)
-
Van_Durme, Benjamin (1)
-
Wang, Hongwei (1)
-
Zhang, Jingyu (1)
-
Zhang, Linjun (1)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Free, publicly-accessible full text available July 1, 2025
-
Hou, Abe ; Zhang, Jingyu ; He, Tianxing ; Wang, Yichen ; Chuang, Yung-Sung ; Wang, Hongwei ; Shen, Lingfeng ; Van_Durme, Benjamin ; Khashabi, Daniel ; Tsvetkov, Yulia ( , NAACL)Existing watermarked generation algorithms employ token-level designs and therefore, are vulnerable to paraphrase attacks. To address this issue, we introduce watermarking on the semantic representation of sentences. We propose SemStamp, a robust sentence-level semantic watermarking algorithm that uses locality-sensitive hashing (LSH) to partition the semantic space of sentences. The algorithm encodes and LSH-hashes a candidate sentence generated by a language model, and conducts rejection sampling until the sampled sentence falls in watermarked partitions in the semantic embedding space. To test the paraphrastic robustness of watermarking algorithms, we propose a {``}bigram paraphrase{''} attack that produces paraphrases with small bigram overlap with the original sentence. This attack is shown to be effective against existing token-level watermark algorithms, while posing only minor degradations to SemStamp. Experimental results show that our novel semantic watermark algorithm is not only more robust than the previous state-of-the-art method on various paraphrasers and domains, but also better at preserving the quality of generation.more » « lessFree, publicly-accessible full text available June 28, 2025
-
The cost of privacy: Optimal rates of convergence for parameter estimation with differential privacyCai, T. Tony ; Wang, Yichen ; Zhang, Linjun ( , The Annals of Statistics)
-
He, Niao ; Harchaoui, Zaid ; Wang, Yichen ; Song, Le ( , Applied Mathematics & Optimization)