- Home
- Search Results
- Page 1 of 1
Search for: All records
- 
                                    Total Resources3
- Resource Type
- 
                                    
                                    
                                    
                                    0003000000000000
- More
- Availability
- 
                                    
                                    12
- Author / Contributor
- Filter by Author / Creator
- 
                                    
                                        - 
                                                    
                                                        
                                                            
                                                            Chaudhuri, Swarat (3)
- 
                                                    
                                                        
                                                            
                                                            Ding, Michelle (2)
- 
                                                    
                                                        
                                                            
                                                            Jennings, John (2)
- 
                                                    
                                                        
                                                            
                                                            Jennings, Michael (2)
- 
                                                    
                                                        
                                                            
                                                            Lee, Jasper (2)
- 
                                                    
                                                        
                                                            
                                                            Thakur, Amitayush (2)
- 
                                                    
                                                        
                                                            
                                                            Tsoukalas, George (2)
- 
                                                    
                                                        
                                                            
                                                            Xin, Jimmy (2)
- 
                                                    
                                                        
                                                            
                                                            Ding, Zhimin (1)
- 
                                                    
                                                        
                                                            
                                                            Hu, Erdong (1)
- 
                                                    
                                                        
                                                            
                                                            Jermaine, Christopher (1)
- 
                                                    
                                                        
                                                            
                                                            Nie, Lunyiu (1)
- 
                                                    
                                                        
                                                            
                                                            #Tyler Phillips, Kenneth E. (0)
- 
                                                    
                                                        
                                                            
                                                            #Willis, Ciara (0)
- 
                                                    
                                                        
                                                            
                                                            & Abreu-Ramos, E. D. (0)
- 
                                                    
                                                        
                                                            
                                                            & Abramson, C. I. (0)
- 
                                                    
                                                        
                                                            
                                                            & Abreu-Ramos, E. D. (0)
- 
                                                    
                                                        
                                                            
                                                            & Adams, S.G. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ahmed, K. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ahmed, Khadija. (0)
 
- 
                                                    
                                                        
                                                            
                                                            
- Filter by Editor
- 
                                    
                                        - 
                                                    
                                                        
                                                            
                                                            & Spizer, S. M. (0)
- 
                                                    
                                                        
                                                            
                                                            & . Spizer, S. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ahn, J. (0)
- 
                                                    
                                                        
                                                            
                                                            & Bateiha, S. (0)
- 
                                                    
                                                        
                                                            
                                                            & Bosch, N. (0)
- 
                                                    
                                                        
                                                            
                                                            & Brennan K. (0)
- 
                                                    
                                                        
                                                            
                                                            & Brennan, K. (0)
- 
                                                    
                                                        
                                                            
                                                            & Chen, B. (0)
- 
                                                    
                                                        
                                                            
                                                            & Chen, Bodong (0)
- 
                                                    
                                                        
                                                            
                                                            & Drown, S. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ferretti, F. (0)
- 
                                                    
                                                        
                                                            
                                                            & Higgins, A. (0)
- 
                                                    
                                                        
                                                            
                                                            & J. Peters (0)
- 
                                                    
                                                        
                                                            
                                                            & Kali, Y. (0)
- 
                                                    
                                                        
                                                            
                                                            & Ruiz-Arias, P.M. (0)
- 
                                                    
                                                        
                                                            
                                                            & S. Spitzer (0)
- 
                                                    
                                                        
                                                            
                                                            & Sahin. I. (0)
- 
                                                    
                                                        
                                                            
                                                            & Spitzer, S. (0)
- 
                                                    
                                                        
                                                            
                                                            & Spitzer, S.M. (0)
- 
                                                    
                                                        
                                                            
                                                            (submitted - in Review for IEEE ICASSP-2024) (0)
 
- 
                                                    
                                                        
                                                            
                                                            
- 
                                    Have feedback or suggestions for a way to improve these results?
 !
                                    
                                        
                                            Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
                                            Some full text articles may not yet be available without a charge during the embargo (administrative interval).
                                        
                                        
                                        
                                            
                                                
                                             What is a DOI Number?
                                        
                                    
                                
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
- 
            We present PUTNAMBENCH, a new multilingual benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems. PUTNAMBENCH consists of 1697 hand-constructed formalizations of 640 theorems sourced from the William Lowell Putnam Mathematical Competition, the premier undergraduate-level mathematics competition in North America. All the theorems have formalizations in Lean 4 and Isabelle; a substantial subset also has Coq formalizations. Proving the theorems requires significant problem-solving ability and proficiency in a broad range of topics taught in undergraduate mathematics courses. We use PUTNAMBENCH to evaluate several established neural and symbolic theorem-provers. These approaches can only solve a handful of the PUTNAMBENCH problems, establishing the benchmark as a difficult open challenge for research on neural theorem-proving. PUTNAMBENCH is available at https://github.com/trishullab/PutnamBench.more » « lessFree, publicly-accessible full text available December 7, 2025
- 
            Tsoukalas, George; Lee, Jasper; Jennings, John; Xin, Jimmy; Ding, Michelle; Jennings, Michael; Thakur, Amitayush; Chaudhuri, Swarat (, Neural Information Processing Systems)We present PutnamBench, a new multi-language benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems. PutnamBench consists of 1692 hand-constructed formalizations of 640 theorems sourced from the William Lowell Putnam Mathematical Competition, the premier undergraduate-level mathematics competition in North America. All the problems have formalizations in Lean 4 and Isabelle; a substantial subset also has Coq formalizations. PutnamBench requires significant problem-solving ability and proficiency in a broad range of topics taught in undergraduate mathematics courses. We use PutnamBench to evaluate several established neural and symbolic theorem-provers. These approaches can only solve a handful of the PutnamBench problems, establishing the benchmark as a difficult open challenge for research on neural theorem-proving. PutnamBench is available at https://github.com/trishullab/PutnamBench.more » « lessFree, publicly-accessible full text available December 1, 2025
- 
            Nie, Lunyiu; Ding, Zhimin; Hu, Erdong; Jermaine, Christopher; Chaudhuri, Swarat (, International Conference on Machine Learning (ICML))Large Language Models (LLMs) have a natural role in answering complex queries about data streams, but the high computational cost of LLM inference makes them infeasible in many such tasks. We propose online cascade learning as an approach to address this challenge. The objective here is to learn a “cascade” of models, starting with lower-capacity models (such as logistic regression) and ending with a powerful LLM, along with a deferral policy that determines the model to be used on a given input. We formulate the task of learning cascades online as an imitation-learning problem, where smaller models are updated over time imitating the LLM expert demonstrations, and give a no-regret algorithm for the problem. Experimental results across four benchmarks show that our method parallels LLMs in accuracy while cutting down inference costs by as much as 90% with strong robustness against input distribution shifts, underscoring its efficacy and adaptability in stream processing.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                     Full Text Available
                                                Full Text Available