EE 609: Convex
Optimization in SP/COM
- Instructors:
Ketan Rajawat
- Prerequisites:
Linear Algebra, Probability
- Objective:
Convex
optimization has recently been applied to a wide variety of problems in
EE,
especially in signal processing, communications, and networks. The aim
of this
course is to train the students in application and analysis of convex
optimization problems in signal processing and wireless communications.
At the
end of this course, the students are expected to:
- Know
about the applications of convex
optimization in signal processing, wireless communications, and
networking
research.
- Be
able to recognize convex optimization
problems arising in these areas.
- Be
able to recognize ‘hidden’ convexity
in many seemingly non-convex problems; formulate them as convex
problems.
- Be
able to develop low-complexity,
approximate solutions for difficult non-convex problems.
- References:
- Stephen Boyd and Lieven Vandenberghe, Convex
Optimization, Cambridge University Press. [Online]. http://www.stanford.edu/~boyd/cvxbook/
- Convex Optimization in Signal Processing and
Communications, D. P. Palomar, Y. C. Eldar. Cambridge Press, 2010.
- IEEE
Signal Processing Magazine- Special Issue on Advances in Convex
Optimization, Vol. 27, No. 3, May 2010.
- Dimitri P. Bertsekas, Convex Analysis and Optimization,
Athena-Scientific, 2003.
- Time and place: Lectures will be made available online, Discussion every Friday at 11am on zoom
- This course will cover (approximately)
- Background on linear algebra
- Convex sets, functions, and problems
- Examples of convex problems: LP, QCQP, SOCP
- Duality, KKT conditions
- Geometric programming and applications
- Linear and quadratic classification
- Network optimization
- Sparse regression, Lasso, ridge regression and
applications in image processing
- Robust least squares and applications in signal processing
- Support vector machines and applications in machine
learning
- Semidefinite programming and applications in experiment
design
- Semidefinite relaxation and applications in MIMO
detection, integer programming
- Low rank matrix completion and applications in
recommendor systems
- Multidimensional scaling and applications in sensor
localization
- Numerical linear algebra, basics of interior point methods
- Tentative list of topics for Term Paper (only
sample papers are provided, others can also be chosen, specify:
theory/mixed/application mode)
- Low-Rank Matrix Factorization:
- Chi, Yuejie, Yue M. Lu, and Yuxin Chen. "Nonconvex optimization meets low-rank matrix factorization: An overview." IEEE Transactions on Signal Processing 67.20 (2019): 5239-5269.
- Chen, Yudong, and Yuejie Chi. "Harnessing structures in big data via
guaranteed low-rank matrix estimation: Recent theory and fast algorithms
via convex and nonconvex optimization." IEEE Signal Processing Magazine 35.4 (2018): 14-31.
- Non-convex optimization for Machine Learning (several topics):
- Jain, Prateek, and Purushottam Kar. "Non-convex optimization for machine learning." arXiv preprint arXiv:1712.07897 (2017).
- Relevant slides: https://www.stat.cmu.edu/~ryantibs/convexopt-F16/lectures/nonconvex.pdf
- Recent Theoretical Advances in Non-Convex Optimization
- Danilova, Marina, et al. "Recent theoretical advances in non-convex optimization." arXiv preprint arXiv:2012.06188 (2020).
- Zhang, Yuqian, Qing Qu, and John Wright. "From symmetry to geometry: Tractable nonconvex problems." arXiv preprint arXiv:2007.06753 (2020).
- Blind deconvolution, demixing, phase retrieval etc:
- S.
Ling and T. Strohmer, “Blind deconvolution meets blind demixing:
Algorithms and performance bounds,” IEEE Trans. Inf. Theory, vol. 63,
no. 7, pp. 4497–4520, 2017.
- A.
Ahmed, A. Aghasi, P. Hand, “Simultaneous phase retrieval and
blind deconvolution via convex programming,” arXiv preprint
arXiv:1904.12680, 2019.
- C.
Ma, K. Wang, Y. Chi, and Y. Chen, “Implicit regularization in nonconvex
statistical estimation: Gradient descent converges linearly for phase
retrieval, matrix completion and blind deconvolution,” arXiv preprint
arXiv:1711.10467, 2017.
- J.
Dong and Y. Shi, “Nonconvex demixing from bilinear measurements,” IEEE
Trans. Signal Process., vol. 66, pp. 5152–5166, Oct. 2018.
- D.
Amelunxen, M. Lotz, M. B. McCoy, and J. A. Tropp, “Living on the edge:
phase transitions in convex programs with random data,” Inf. Inference,
vol. 3, pp. 224–294, Jun. 2014.
- Difference-of-convex programming:
- J.-y. Gotoh, A. Takeda, and K. Tono, “DC formulations and algorithms for sparse optimization problems,” Math. Program., vol. 169, no. 1, pp. 141–176, 2018.
- Riemannian/geometric optimization (generalizes geometric programming):
- Introductory slides: https://www.math.fsu.edu/~whuang2/pdf/slides_2016-GMDL.pdf
- Papers by Suvrat Sra from MIT, see relevant talk at: https://www.youtube.com/watch?v=-1BK70HM6ME
- Blog post by Andreas Bloch: https://andbloch.github.io/Stochastic-Gradient-Descent-on-Riemannian-Manifolds
- H Zhang, S J. Reddi, S. Sra, Riemannian SVRG: Fast Stochastic Optimization on Riemannian Manifolds
- Online
learning and online convex optimization
- Shai
Shalev-Shwartz, Foundations and Trends in ML, 4(2), 2011
- Introduction
to Online optimization
- Elad
Hazan, Foundations and Trends in Optimization, 2(3-4), 2015
- Proximal
Algorithms
- Neal
Parikh and Stephen Boyd (2014), "Proximal Algorithms", Foundations and
Trends in Optimization: Vol. 1: No. 3, pp 127-239
- Multi-period
investment
- Stephen
Boyd, Mark T. Mueller, Brendan O’Donoghue and Yang Wang (2014),
"Performance Bounds and Suboptimal Policies for Multi–Period
Investment", Foundations and Trends in Optimization: Vol. 1: No.
1
- Stephen
Boyd, Enzo Busseti, Steve Diamond, Ronald N. Kahn, Kwangmoo Koh, Peter
Nystrup and Jan Speth (2017), "Multi-Period Trading via Convex
Optimization", Foundations and Trends in Optimization: Vol. 3: No.
1
- Submodularity
and convexity
- Francis
Bach (2013), "Learning with Submodular Functions: A Convex Optimization
Perspective", Foundations and Trends in Machine Learning: Vol. 6: No.
2-3.
- Learning
over Networks
- Ali
H. Sayed (2014), "Adaptation, Learning, and Optimization over
Networks", Foundations and Trends in Machine Learning: Vol. 7: No.
4-5, pp 311-801.
- Approximate
dynamic programming in transportation/logistics
- W.
B. Powell, H. Simao, B. Bouzaiene-Ayari, “Approximate Dynamic
Programming in Transportation and Logistics: A Unified Framework,”
European J. on Transportation and Logistics, Vol. 1, No. 3, pp. 237-284
(2012). DOI 10.1007/s13676-012-0015-8.
- Sequential
stochastic optimization, dynamic programming
- http://castlelab.princeton.edu/html/Papers/Powell-UnifiedFrameworkStochasticOptimization_July222017.pdf
- Clearing
the Jungle of Stochastic Optimization, Informs Tutorials in Operations
Research: Bridging Data and Decisions, pp. 109-137, November, 2014
- Subgradient
methods (older papers, find newer ones):
- S.
Boyd and A. Mutapcic, Subgradient Methods , Notes for EE364b,
Stanford University, 2006. Available online.
- N.
Shor, Minimization Methods for
Non-Differentiable Functions , Springer Series in
Computational Mathematics, Springer, 1985.
- D.
Bertsekas, Nonlinear Programming. Athena Scientific, 1999.
- Robust
Optimization
- Bertsimas,
Dimitris, David B. Brown, and Constantine Caramanis. "Theory and
applications of robust optimization." SIAM review
53.3 (2011): 464-501.
- Xu,
Huan, Constantine Caramanis, and Sujay Sanghavi. "Robust PCA via
outlier pursuit." Advances in Neural Information Processing
Systems. 2010.
- Coordinate
ascent/descent
- Yu.
Nesterov, Efficiency of coordinate descent methods on huge-scale
optimization problems, SIAM Journal on Optimization, 2012
- P.
Richt ́arik and M. Takac, Iteration complexity of randomized
block-coordinate descent methods for minimizing a composite function,
Mathematical Programming, 2014
- P.
Tseng and S. Yun, A coordinate gradient descent method for nonsmooth
separable minimization, Mathematical Programming, 2009
- Large-scale
optimization
- Feng
Niu, Benjamin Recht, Christopher Re, and Stephen Wright. Hogwild: A
lock-free approach to parallelizing stochastic gradient descent. In
Advances in neural information processing systems, pages 693–701, 2011
- Gemulla,
Rainer, et al. "Large-scale matrix factorization with distributed
stochastic gradient descent." Proceedings of the 17th ACM
SIGKDD international conference on Knowledge discovery and data mining.
ACM, 2011.
- Bottou,
Leon. "Large-scale machine learning with stochastic gradient descent." Proceedings
of COMPSTAT'2010. Physica-Verlag HD, 2010. 177-186.
- Semi-stochastic
methods:
- Konecny,
Jakub, and Peter Richtarik. "Semi-stochastic gradient descent methods."
arXiv preprint arXiv:1312.1666 (2013).
- Konecny,
Jakub, et al. "Mini-batch semi-stochastic gradient descent in the
proximal setting." IEEE Journal of Selected Topics in Signal
Processing 10.2 (2016): 242-255.
- Konecny,
Jakub, Zheng Qu, and Peter Richtarik. "Semi-stochastic coordinate
descent." Optimization Methods and Software 32.5
(2017): 993-1005.
- Konecny,
Jakub, Zheng Qu, and Peter Richtarik. "S2cd: Semi-stochastic coordinate
descent." NIPS Optimization in Machine Learning workshop.
2014.
- Stochastic
and online optimization
- A.
Nemirovski, A. Juditsky, G. Lan and A. Shapiro,Robust stochastic
approximation approach to stochastic programming, SIAM Journal on
Optimization (2009)
- Yu.
Nesterov, Primal-dual subgradient methods for convex problems,
Mathematical Programming (2009)
- L.
Xiao, Dual averaging methods for regularized stochastic learning and
online optimization, Journal of Machine Learning Research (2010)
- N.
Le Roux, M. Schmidt and F. Bach, A stochastic gradient method with an
exponential convergence rate for strongly convex optimization with
finite training sets, NIPS (2012)
- Variance
reduced stochastic gradient descent
- R.
Johnson and T. Zhang, Accelerating stochastic gradient descent using
predictive variance reduction NIPS (2013)
- L.
Xiao and T. Zhang, A proximal stochastic gradient method with
progressive variance reduction
- Defazio,
Aaron, Francis Bach, and Simon Lacoste-Julien. "SAGA: A fast
incremental gradient method with support for non-strongly convex
composite objectives." Advances in neural information
processing systems. 2014.
- Schmidt,
Mark, Nicolas Le Roux, and Francis Bach. "Minimizing finite sums with
the stochastic average gradient." Mathematical Programming
162.1-2 (2017): 83-112.
- ADMM
variants
- D.
Goldfarb, S. Ma, K. Scheinberg, Fast alternating linearization methods
for minimizing the sum of two convex functions, (2010)
- T.
Goldstein and S. Osher, The split Bregman method for L1-regularized
problems, SIAM J. Imag. Sciences (2009)
- Subgradient
methods
- Nedic,
Angelia, and Asuman Ozdaglar. "Distributed subgradient methods for
multi-agent optimization." IEEE Transactions on Automatic Control 54.1
(2009): 48-61.
- Ram,
S. Sundhar, Angelia Nedić, and Venugopal V. Veeravalli. "Distributed
stochastic subgradient projection algorithms for convex optimization." Journal
of optimization theory and applications 147.3 (2010): 516-545.
- Tsitsiklis,
John, Dimitri Bertsekas, and Michael Athans. "Distributed asynchronous
deterministic and stochastic gradient optimization algorithms." IEEE
transactions on automatic control 31.9 (1986): 803-812.
- Smoothing
- Yu.
Nesterov, Smooth minimization of non-smooth functions, Mathematical
Programming (2005).
- Yu.
Nesterov, Excessive gap technique in nonsmooth convex minimization,
SIAM Journal on Optimization (2005)
- Majorization
Minimization
- Sun,
Ying, Prabhu Babu, and Daniel P. Palomar. "Majorization-minimization
algorithms in signal processing, communications, and machine learning."
IEEE Transactions on Signal Processing 65.3
(2017): 794-816.
- Figueiredo,
Mario AT, Jose M. Bioucas-Dias, and Robert D. Nowak.
"Majorization–minimization algorithms for wavelet-based image
restoration." IEEE Transactions on Image processing
16.12 (2007): 2980-2991.
- Mairal,
Julien. "Incremental majorization-minimization optimization with
application to large-scale machine learning." SIAM Journal on
Optimization 25.2 (2015): 829-855.
- Convex
optimization in Finance
- Yiyong
Feng and Daniel P. Palomar, A
Signal Processing Perspective on Financial Engineering, Foundations
and Trends in Signal Processing, Now Publishers, vol. 9, no. 1-2, 2016.
- Blind
deconvolution
- Ahmed,
Ali, Benjamin Recht, and Justin Romberg. "Blind deconvolution using
convex programming." IEEE Transactions on Information Theory
60.3 (2014): 1711-1732.
- Chan,
Tsung-Han, et al. "A convex analysis framework for blind separation of
non-negative sources." IEEE Transactions on Signal Processing
56.10 (2008): 5120-5134.
- Gillis,
Nicolas, and Stephen A. Vavasis. "Fast and robust recursive
algorithmsfor separable nonnegative matrix factorization." IEEE
transactions on pattern analysis and machine intelligence
36.4 (2014): 698-714.
- Network
utility maximization (Palomar Chiang)
- Palomar,
Daniel Perez, and Mung Chiang. "A tutorial on decomposition methods for
network utility maximization." IEEE Journal on Selected Areas
in Communications 24.8 (2006):
1439-1451.
- Palomar,
Daniel P., and Mung Chiang. "Alternative distributed algorithms for
network utility maximization: Framework and applications." IEEE
Transactions on Automatic Control 52.12 (2007): 2254-2269.
- Segmentation
and multiview 3D reconstruction in image processing
- Daniel
Cremers, Thomas Pock, Kalin Kolev, and Antonin Chambolle, “Convex
Relaxation Techniques for Segmentation, Stereo and Multiview
Reconstruction” In Markov Random Fields for Vision and Image
Processing. MIT Press, 2011.
- Antonin
Chambolle, Daniel Cremers, and Thomas Pock,“A convex approach to
minimal partitions” SIAM Journal on Imaging Sciences. 5(4):1113–1158,
2012
- Kalin
Kolev, Maria Klodt, Thomas Brox, and Daniel Cremers,“Continuous global
optimization in multiview 3d reconstruction,” International Journal of
Computer Vision. 84(1):80–96, 2009.
- Game
theoretic communication system design
- Gesualdo
Scutari, Daniel P. Palomar, and Sergio Barbarossa, “Competitive Design
of Multiuser MIMO Systems based on Game Theory: A Unified View,” IEEE
JSAC: Special Issue on Game Theory , vol. 25, no. 7, Sept. 2008.
- Gesualdo
Scutari, Daniel P. Palomar, and Sergio Barbarossa, “Optimal Linear
Precoding Strategies for Wideband Noncooperative Systems Based on Game
Theory,” IEEE Trans. on Signal Processing , vol. 56, no. 3,
March 2008.
- Gesualdo
Scutari, Daniel P. Palomar, and Sergio Barbarossa , “Asynchronous
Iterative Water-Filling for Gaussian Frequency-Selective Interference
Channels,” IEEE Trans. on Information Theory, vol. 54, no. 7,
July 2008.
- Gesualdo
Scutari, Daniel P. Palomar, and Sergio Barbarossa, “Cognitive MIMO
Radio: A Competitive Optimality Design Based on Subspace
Projections,” IEEE Signal Processing Magazine, Nov. 2008.
- Signomial
Programming:
- Mung
Chiang, “Geometric programming
for communication systems,” Foundations and Trends
in Communications and Information Theory , Now Publishers,
vol. 2, no. 1-2, Aug. 2006.
- Mung
Chiang, Chee Wei Tan,
Daniel P. Palomar, Daniel
O’Neill, and David Julian, “Power Control
by Geometric Programming,” IEEE Trans. on Wireless
Communications, vol. 6, no. 7, pp.
2640-2651, July 2007
- Index
tracking in finance
- Benidis,
K., Feng, Y., and Palomar, D. P. (2017). Sparse Portfolios for
High-Dimensional Financial Index Tracking. IEEE Transactions on Signal
Processing
- Xu,
F., Xu, Z., and Xue, H., (2015). Sparse index tracking based on L1/L2
model and algorithm. arXiv preprint
- Jansen,
R., and Van Dijk, R. (2002). Optimal benchmark tracking with small
portfolios The Journal of Portfolio Management vol. 28, no. 2, pp.
33–39.
- Beasley,
J. E., Meade, N., and Chang, T.-J. (2003). An evolutionary heuristic
for the index tracking problem. European Journal of Operational
Research, vol. 148, no. 3, pp. 621–643.
- Scozzari,
A., Tardella, F., Paterlini, S., and, Krink, T. (2013). Exact and
heuristic approaches for the index tracking problem with UCITS
constraints. Annals of Operations Research, vol. 205, no. 1, pp.
235–250.
- Multiobjective
optimization and pareto optimality
- Marler,
R. Timothy, and Jasbir S. Arora. "Survey of multi-objective
optimization methods for engineering." Structural and multidisciplinary
optimization 26.6 (2004): 369-395.
- Desideri,
Jean-Antoine. "Multiple-gradient descent algorithm (MGDA) for
multiobjective optimization." Comptes Rendus Mathematique 350.5-6
(2012): 313-318.
- Fliege,
Jorg, and Benar Fux Svaiter. "Steepest descent methods for
multicriteria optimization." Mathematical Methods of
Operations Research 51.3 (2000): 479-494.
- Harada,
Ken, Jun Sakuma, and Shigenobu Kobayashi. "Local search for
multiobjective function optimization: pareto descent method."
Proceedings of the 8th annual conference on Genetic and evolutionary
computation. ACM, 2006.
- Fliege,
Joerg, LM Grana Drummond, and Benar Fux Svaiter. "Newton's method for
multiobjective optimization." SIAM Journal on Optimization 20.2 (2009):
602-626.
- Saddle
point methods for convex optimization
- Zhu,
Minghui, and Sonia Martinez. "On distributed convex optimization under
inequality and equality constraints." IEEE Transactions on Automatic
Control 57.1 (2012): 151-164.
- Chang,
Tsung-Hui, Angelia Nedić, and Anna Scaglione. "Distributed constrained
optimization by consensus-based primal-dual perturbation method." IEEE
Transactions on Automatic Control 59.6 (2014): 1524-1538.
- Integer
programming
- Park,
Jaehyun, and Stephen Boyd. "A semidefinite programming method for
integer convex quadratic minimization." Optimization Letters
(2017): 1-20.
- Marchand,
Hugues; Martin, Alexander; Weismantel, Robert; Wolsey, Laurence (2002).
"Cutting planes in integer and mixed integer programming" (PDF).
Discrete Applied Mathematics. 123: 387–446.
- Castro, J.;
Nasini, S.; Saldanha-da-Gama, F. (2017). "A
cutting-plane approach for large-scale capacitated multi-period
facility location using a specialized interior-point method"
- Bader, David
A.; Hart, William E.; Phillips, Cynthia A. (2004). "Parallel
Algorithm Design for Branch and Bound" (PDF). In
Greenberg, H. J. Tutorials on Emerging Methodologies and Applications
in Operations Research. Kluwer Academic Press.
- Distributed
Optimization
- A.
Nedic and A.
Ozdaglar. Distributed
subgradient methods for multi-agent optimization. IEEE
Transactions on Automatic Control , 54(1):48–61, Jan. 2009.
- J.
C. Duchi, A. Agarwal, and M. J. Wainwright. "Dual averaging for
distributed optimization: Convergence analysis and network scaling," IEEE Transactions on Automatic Control, 57(3):592–606, Mar.
2012
- E.
Wei and A.
Ozdaglar. Distributed
alternating direction method of
multipliers. In 51st IEEE
Annual Conference on Decision and Control, pages 5445–5450, Dec. 2012.
- J.
F. C. Mota, J. M. F. Xavier, P. M. Q. Aguiar, and M. Puschel. D-ADMM: A
communication-efficient distributed algorithm for separable optimization. IEEE Transactions on Signal Processing,
61(10):2718–2723, May 2013.
- W.
Shi, Q. Ling, K Yuan,
G Wu, and W
Yin. On the linear convergence
of the admm in decentralized consensus optimization. IEEE Transactions on Signal Processing , 62(7):1750–1761,
April 2014.
- Convex
optimization in Control Theory
- Linear
Matrix Inequalities in System and Control Theory Stephen Boyd, Laurent
El Ghaoui, E. Feron, and V. Balakrishnan
- Linear
Controller Design – Limits of Performance, Stephen Boyd and Craig
Barratt
- Convex
optimization in Smart Grid
- Samadi,
Pedram, et al. "Optimal real-time pricing algorithm based on utility
maximization for smart grid." Smart Grid Communications
(SmartGridComm), 2010 First IEEE International Conference on. IEEE,
2010.
- Mohsenian-Rad,
Amir-Hamed, et al. "Autonomous demand-side management based on
game-theoretic energy consumption scheduling for the future smart
grid." IEEE transactions on Smart Grid 1.3 (2010):
320-331.
- Sortomme,
Eric, et al. "Coordinated charging of plug-in hybrid electric vehicles
to minimize distribution system losses." IEEE transactions on
smart grid 2.1 (2011): 198-205.
- Convex
optimization in robotics
- Verscheure,
Diederik, et al. "Time-optimal path tracking for robots: A convex
optimization approach." IEEE Transactions on Automatic Control
54.10 (2009): 2318-2327.
- Schulman,
John, et al. "Finding Locally Optimal, Collision-Free Trajectories with
Sequential Convex Optimization." Robotics: science and systems.
Vol. 9. No. 1. 2013.
- Zhu,
Minghui, and Sonia Martinez. "On distributed convex optimization under
inequality and equality constraints." IEEE Transactions on Automatic
Control 57.1 (2012): 151-164.
- Derenick,
Jason C., and John R. Spletzer. "Convex optimization strategies for
coordinating large-scale robot formations." IEEE Transactions on
Robotics 23.6 (2007): 1252-1259.
- Convex
optimization for motion planning
- Schulman,
John, et al. "Finding Locally Optimal, Collision-Free Trajectories with
Sequential Convex Optimization." Robotics: science and systems.
Vol. 9. No. 1. 2013.
- Prajna,
Stephen, Pablo A. Parrilo, and Anders Rantzer. "Nonlinear control
synthesis by convex optimization." IEEE Transactions on
Automatic Control 49.2 (2004): 310-314.
- Schulman,
John, et al. "Motion planning with sequential convex optimization and
convex collision checking." The International Journal of
Robotics Research 33.9 (2014): 1251-1270