Search for: All records

Creators/Authors contains: "Pilanci, Mert"

« Prev Next »

Total Resources

32

Resource Type
Conference Paper

27

Conference Proceeding

0

Dataset

0

Journal Article

5

Workshop Report

0

Availability
Full Text / Resource Available

29

Citation Only

3

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Path regularization: A convexity and sparsity inducing regularization for parallel relu networks

Ergen, Tolga ; Pilanci, Mert ( December 2023 , Neural Information Processing Systems (NeurIPS))

Free, publicly-accessible full text available December 10, 2024
Fixing the NTK: From Neural Network Linearizations to Exact Convex Programs

Dwaraknath, Rajat Vadiraj ; Ergen, Tolga ; Pilanci, Mert ( December 2023 , Neural Information Processing Systems (NeurIPS))

Free, publicly-accessible full text available December 10, 2024
Sketching the Krylov subspace: faster computation of the entire ridge regularization path

https://doi.org/10.1007/s11227-023-05309-w

Wang, Yifei ; Pilanci, Mert ( January 2023 , The Journal of Supercomputing)

We propose a fast algorithm for computing the entire ridge regression regularization path in nearly linear time. Our method constructs a basis on which the solution of ridge regression can be computed instantly for any value of the regularization parameter. Consequently, linear models can be tuned via cross-validation or other risk estimation strategies with substantially better efciency. The algorithm is based on iteratively sketching the Krylov subspace with a binomial decomposition over the regularization path. We provide a convergence analysis with various sketching matrices and show that it improves the state-of-the-art computational complexity. We also provide a technique to adaptively estimate the sketching dimension. This algorithm works for both the over-determined and under-determined problems. We also provide an extension for matrix-valued ridge regression. The numerical results on real medium and large-scale ridge regression tasks illustrate the efectiveness of the proposed method compared to standard baselines which require super-linear computational time.
more » « less
Full Text Available
Optimal Sets and Solution Paths of ReLU Networks

Mishkin, Aaron ; Pilanci, Mert ( January 2023 , International Conference on Machine Learning)

We develop an analytical framework to characterize the set of optimal ReLU neural networks by reformulating the non-convex training problem as a convex program. We show that the global optima of the convex parameterization are given by a polyhedral set and then extend this characterization to the optimal set of the non-convex training objective. Since all stationary points of the ReLU training problem can be represented as optima of sub-sampled convex programs, our work provides a general expression for all critical points of the non-convex objective. We then leverage our results to provide an optimal pruning algorithm for computing minimal networks, establish conditions for the regularization path of ReLU networks to be continuous, and develop sensitivity results for minimal ReLU networks.
more » « less
Full Text Available
Optimal Shrinkage for Distributed Second-Order Optimization

Zhang, Fangzhao ; Pilanci, Mert ( January 2023 , International Conference on Machine Learning)

In this work, we address the problem of Hessian inversion bias in distributed second-order optimization algorithms. We introduce a novel shrinkage-based estimator for the resolvent of gram matrices that is asymptotically unbiased, and characterize its non-asymptotic convergence rate in the isotropic case. We apply this estimator to bias correction of Newton steps in distributed second-order optimization algorithms, as well as randomized sketching based methods. We examine the bias present in the naive averaging-based distributed Newton’s method using analytical expressions and contrast it with our proposed biasfree approach. Our approach leads to significant improvements in convergence rate compared to standard baselines and recent proposals, as shown through experiments on both real and synthetic datasets.
more » « less
Full Text Available
Adaptive and Oblivious Randomized Subspace Methods for High-Dimensional Optimization: Sharp Analysis and Lower Bounds

https://doi.org/10.1109/TIT.2022.3146206

Lacotte, Jonathan ; Pilanci, Mert ( May 2022 , IEEE Transactions on Information Theory)

Full Text Available
Orthonormal Sketches for Secure Coded Regression

https://doi.org/10.1109/ISIT50566.2022.9834356

Charalambides, Neophytos ; Mahdavifar, Hessam ; Pilanci, Mert ; Hero, Alfred O. ( June 2022 , 2022 IEEE International Symposium on Information Theory (ISIT))

Full Text Available
Coil sketching for computationally efficient MR iterative reconstruction

https://doi.org/10.1002/mrm.29883

Oscanoa, Julio A. ; Ong, Frank ; Iyer, Siddharth S. ; Li, Zhitao ; Sandino, Christopher M. ; Ozturkler, Batu ; Ennis, Daniel B. ; Pilanci, Mert ; Vasanawala, Shreyas S. ( October 2023 , Magnetic Resonance in Medicine)

Abstract Purpose
Parallel imaging and compressed sensing reconstructions of large MRI datasets often have a prohibitive computational cost that bottlenecks clinical deployment, especially for three‐dimensional (3D) non‐Cartesian acquisitions. One common approach is to reduce the number of coil channels actively used during reconstruction as in coil compression. While effective for Cartesian imaging, coil compression inherently loses signal energy, producing shading artifacts that compromise image quality for 3D non‐Cartesian imaging. We propose coil sketching, a general and versatile method for computationally‐efficient iterative MR image reconstruction.
Theory and Methods
We based our method on randomized sketching algorithms, a type of large‐scale optimization algorithms well established in the fields of machine learning and big data analysis. We adapt the sketching theory to the MRI reconstruction problem via a structured sketching matrix that, similar to coil compression, considers high‐energy virtual coils obtained from principal component analysis. But, unlike coil compression, it also considers random linear combinations of the remaining low‐energy coils, effectively leveraging information from all coils.
Results
First, we performed ablation experiments to validate the sketching matrix design on both Cartesian and non‐Cartesian datasets. The resulting design yielded both improved computatioanal efficiency and preserved signal‐to‐noise ratio (SNR) as measured by the inverse g‐factor. Then, we verified the efficacy of our approach on high‐dimensional non‐Cartesian 3D cones datasets, where coil sketching yielded up to three‐fold faster reconstructions with equivalent image quality.
Conclusion
Coil sketching is a general and versatile reconstruction framework for computationally fast and memory‐efficient reconstruction.

more » « less
Neural Fisher Discriminant Analysis: Optimal Neural Network Embeddings in Polynomial Time

Bartan, Burak ; Pilanci, Mert ( April 2021 , Proceedings of Machine Learning Research)

Fisher’s Linear Discriminant Analysis (FLDA) is a statistical analysis method that linearly embeds data points to a lower dimensional space to maximize a discrimination criterion such that the variance between classes is maximized while the variance within classes is minimized. We introduce a natural extension of FLDA that employs neural networks, called Neural Fisher Discriminant Analysis (NFDA). This method finds the optimal two-layer neural network that embeds data points to optimize the same discrimination criterion. We use tools from convex optimization to transform the optimal neural network embedding problem into a convex problem. The resulting problem is easy to interpret and solve to global optimality. We evaluate the method’s performance on synthetic and real datasets.
more » « less
Full Text Available
Training Quantized Neural Networks to Global Optimality via Semidefinite Programming

Bartan, Burak ; Pilanci, Mert ( April 2021 , Proceedings of Machine Learning Research)

Neural networks (NNs) have been extremely successful across many tasks in machine learning. Quantization of NN weights has become an important topic due to its impact on their energy efficiency, inference time and deployment on hardware. Although post-training quantization is well-studied, training optimal quantized NNs involves combinatorial non-convex optimization problems which appear intractable. In this work, we introduce a convex optimization strategy to train quantized NNs with polynomial activations. Our method leverages hidden convexity in two-layer neural networks from the recent literature, semidefinite lifting, and Grothendieck’s identity. Surprisingly, we show that certain quantized NN problems can be solved to global optimality provably in polynomial time in all relevant parameters via tight semidefinite relaxations. We present numerical examples to illustrate the effectiveness of our method.
more » « less
Full Text Available

« Prev Next »