Tuesday, April 25, 2017

Algorithm and Activism

There is a lot more to algorithms and activism, but for now, take a listen to this page of Adela Wagner and her simple but powerful work.

Friday, April 14, 2017

Data Science and Game Theory Workshop


Third Workshop on Algorithmic Game Theory and Data Science, to be held June 26, 2017 in Cambridge, Massachusetts.

From the call:

Computer systems have become the primary mediator of social and economic interactions, enabling transactions at ever-increasing scale. Mechanism design when done on a large scale needs to be a data-driven enterprise. It seeks to optimize some objective with respect to a huge underlying population that the mechanism designer does not have direct access to. Instead, the mechanism designer typically will have access to sampled behavior from that population (e.g. bid histories, or purchase decisions). This means that, on the one hand, mechanism designers will need to bring to bear data-driven methodology from statistical learning theory, econometrics, and revealed preference theory. On the other hand, strategic settings pose new challenges in data science, and approaches for learning and inference need to be adapted to account for strategization. The goal of this workshop is to frame the agenda for research at the interface of algorithms, game theory, and data science.


Wednesday, April 12, 2017

Algorithms in the Field: The PI Meeting

Algorithms in the Field is a program of Algorithms + Other CS researchers working jointly "in the field" of problems. Here is a history of development of this program. Recently I attended the PI meeting for this NSF program:
  • Nearly all the talks were interesting, genuinely targeting the middle ground. They didnt go into the rabbit holes of describing the minute technical improvements in approximation ratios or the 6 different data sets they had to cull to compare 4 different algorithms. Each speaker made an effort to cover the field area (special hardware like memristors, SDNs, wireless receivers etc) and also provide an overview of the algorithmic challenges. This was refreshing. 
  • There were some very interesting examples of the joint research: Ashish Goel spoke about societal decision making and examples such as budget decisions, viewed as collaborative convex programs; Piotr Indyk, ever-wise with his choices, spoke about the wireless transmitter/receiver setting for most part of his talk, really communicating the nature of the "field" before quickly connecting it to sparse Fourier estimation problems; Vyas Sekar spoke about formulating SDN routing policies using path constraints and posed some open problems, which Bobby Kleinberg who went just after, proceeded to solve at least partially, with his work. This shows (a) Algorithms in the Field research can happen in real time and (b) Dont speak before Bobby and pose open problems. 
ps: Sucheta Soundarajan and Martin Farach-Colton organized the meeting and did an excellent call to include graduate students/postdocs in the invite, graduate students/postdocs being the glue between professors and conduits for communication. 


Documenta 12--14

I think there is a quote that, "I have never seen a good Rothko, but I have seen a great roomful of Rothkos". I dont know who said it, but I am going to attribute it to Assaf Naor who reminded me of this phenomenon a while ago. Anyway, DC has a roomful (or two) of Rothkos.

Likewise, Documenta is an art show (Not Art Basel, but Art Kassel!), it may be good by itself, but great seen in mental juxtaposition. Here are the New York Times reviews of  Documenta 12 from 2007, 13 from 2012 and 14 from 2017 which has just started. 

Monday, March 20, 2017

Self II

Here is how the world sees me.
  • There is a coffee place in SFO, tucked away where people dont make it often. I pick up coffee there when I fly in from EWR, and they give me a discount because "you are a taxi driver". 
  • I went to a WeWork location in NY city to meet a friend, and as I walked in, the receptionist said, "talk to that person, she knows who needs the handyman in the building", and I got to go through the process of a fixer-upper to enter the building. 

Self I

People seem to like it when I poke at myself:

In a recent conversation, we discussed Dad Jeans, described precisely here, but more as a state of mind. A parent needs to think several steps ahead on behalf of their kid who can swerve from disegaged to insightful, be prepared for spills, and be prepared to be out the door the instant the kids are ready unexpectedly for the playground in the winter. So, Dad Jeans, is the choice of wear, it communicates that you are unable to be anything or be anywhere else, beyond your control.

Being me, I have to find my own way to express that state of mind, so these days I am doing Dad Hair, baggy, ready to follow me instantly, and unable to be anything else. :)

Monday, March 06, 2017

CS Divisions

Thanks to a recommendation from Marc Donner from old google days who now runs Uber, NYC, I am reading Sapiens by Yuval Harari. The author tries to explain the history of humans, succinctly, and succeeds by having an insightful view of anthropology, sociology, behavioral theory, and of course, science and religion too.  One of the interesting parts for me was the need humans felt to divide people into categories (think commoner/noble, castes, etc).  Alas, with division into categories, comes an imposed order among them and fights to invert the order. The author argues that this imagined order among humans keeps societies stable when it works, and unstable when it doesnt.

I have always been suspicious of divisions. In CS, folks divide areas of research. These are not islands.  In any area of research (say AI, social networks, Robotics, Brain, whatever), there are (a) theoretical foundations and optimizations, (b) new systems research into hardware and software needed to program them, compile into executables, execute them efficiently, (c) new data and UI systems to use, analyze, report, mine and troubleshoot, and so on. A great research will include conceptual breakthroughs, cacophony of math symbols no more than what is needed, potential for pretty plots, and a storyline for NY Times for societal impact. Most individuals' research doesnt hit on all these metrics, doesnt have to, we rely on the cumulation of research to hit all of the metrics. Any research area will be potentially less engaging without ALL of these elements, no order amongst them is needed. 

Extreme Streaming

I am making my way back into researching streaming problems.

One of the directions I am focusing on: how to use not polylog memory as is standard in streaming algorithms, but even smaller, say O(1) memory.  My coauthors and I have such algorithms for estimating the H-index on streams (to appear in PODS 2017, will be on arxiv soon) and estimating heavy hitters in a stream of streams model (to appear in SDN 2017).

I was sort of pushed into this model the way I like to find problems in general. If you look at modern applications, there are some real constraints. For examples in SDNs (Software Defined Networks), there are memory pipelines that packets can percolate through, each memory stage can be thought of as a row of standard sketches, and then one needs to compute something on top of these row estimates, but use only memory that can fit into a single packet header. Another example is that streaming analyses are done for a very large number of groups (say for each source IP address or internet user) and in that case, polylog memory per group is already far too much.

I call these extreme streaming problems, inspired by Extreme Classification in Machine Learning, which studies ML problems with a very large number of labels. I think there is more to mill here.


Sunday, February 26, 2017


Bit longer than a tweet:

  • I was late for my meeting because I was stopped at a traffic light at some NYC corner, a couple asked, "Hey, you from around here? Can you recommend a restaurant?", and I responded. Within a 3 block radius of  any street intersection in Manhattan, there are enough good restaurants to keep one talking for a while. 
  • I entered some word into iPhone during email and it autocorrected to "art". I must have done or at least talked about art sometime in the past. :)
  • I was at a pst and the corporation that trying to convince someone to be a customer of their "critical" service said, "We want to be the ONE throat you choke, if you have a problem." 

Sunday, February 19, 2017

Exciting new book

My long term collaborator, thinker, and a theory researcher Ramesh Hariharan has put together a book that sounds fascinating: Genomic Quirks: The Search for Spelling Errors.

"This is a book of real stories about the search for genomic spelling errors that have stark consequences -- infants who pass away mysteriously, siblings with misplaced organs, a family with several instances of vision loss, sisters whose hearts fail in the prime of their youth, a boy whose blood can’t carry enough oxygen, a baby with cancer in the eye, a middle-aged patient battling cancer, and the author’s own color blindness. The search in each case proves to be a detective quest that connects the world of medical practice with that of molecular biology, traversing the world of computer algorithms along the way."