coq-club AT inria.fr

Subject: The Coq mailing list

List archive

Re: [Coq-Club] Proof automation class ideas

From: Talia Ringer <tringer AT cs.washington.edu>
To: coq-club AT inria.fr
Subject: Re: [Coq-Club] Proof automation class ideas
Date: Sat, 21 Aug 2021 19:20:44 -0500
Authentication-results: mail3-smtp-sop.national.inria.fr; spf=None smtp.pra=tringer AT cs.washington.edu; spf=Pass smtp.mailfrom=tringer AT cs.washington.edu; spf=None smtp.helo=postmaster AT mx7.cs.washington.edu
Ironport-hdrordr: A9a23:BZqE16FPeSw2kB/UpLqEMMeALOsnbusQ8zAXPiFKOGdom6mj/fxG885rsSMc5AxhPU3I3OrwWpVoIkm9yXcW2/h2AV7KZmCP0wbFEGgh1/qA/9SKIVyGygcy79YZT0G8MrLNJGk/o8Lz4Az9Nc0hztmB+KXtoevF1X9iQUVLRshbnmBE48qgfHGejTMmOaYE
Ironport-phdr: A9a23:Cn6FBhWLR8KrT7o2OgcyGLXw33PV8KwGVTF92vMcY1JmTK2v8tzYMVDF4r011RmVB92duqsP1rWempujcFRI2YyGvnEGfc4EfD4+ouJSoTYdBtWYA1bwNv/gYn9yNs1DUFh44yPzahANS47xaFLIv3K98yMZFAnhOgppPOT1HZPZg9iq2+yo9JDffRlEiCC5bL9vIxm7rQfcvdQKjIV/Lao81gHHqWZSdeRMwmNoK1OTnxLi6cq14ZVu7Sdete8/+sBZSan1cLg2QrJeDDQ9LmA6/9brugXZTQuO/XQTTGMbmQdVDgff7RH6WpDxsjbmtud4xSKXM9H6QawyVD+/6apgVR3mhzodNzMh/27XhM5/gqJVrhyiuhJx3ZLbbZqPO/ZiZK7QZ88WSXZDU8tXSidPApm8b4wKD+cZI+hXtY39p1oUohCjGQesBeXvyjBWiX/swKY31PghERvH3AwmENIBrm7Uoc7pO6cJS+y0wrPHzS7Db/NX3zf955TIchcnof2WQ71/bNfRxFApGgjYgVqetZbrMCmJ1uQRrWeb9exgWPqthmMntw18oSSiy8kjh4TGgo8Y1FTJ+Dh2zos7J9C1VFN3bNy5HJdMtCyXKpZ6T8wsTm12vCs3yLMLtJ21cSYFzpks2h3Ra+SffoSV7R/vSPydLSp3iX57Yr6yiQy+/VWvx+D4TsW4zkpGoy5fntTPtn0BzQHf58mGR/dn/kqtxzCC3B3J5O5eO0A7j6/bJoYhwrEukpoTtlzOHir5mUj0lqOZal4k9vKm6uv9ebXmpp6cN4l7igHiNaQunNazDvolPQgTR2Sb+OK826P//UDhXblHgeE6nrPEvJ3VO8gXvKC0DxVI3oss7xuzFzKm384ZnXkDIlJFYhWHj43xNlHSOvD4DPG/g1uynzdx3P3JIqbhDo/DL3fZi7fhfbd960hTyAovytBf4YhYBa8cL/LuQkPxrsDXDgclMwyoxObqEMly1oQHWW6WHqCZNL7SvkST6+I0I+iMYZcVtyznJ/gk4f7ul345lkUHcamnx5tEIEy/S/9hOgCSZWfmqtYHC2YD+AQkH8Lwj1jXbTdXZn/6ZaM66TwhQNa6F4bFSY23qLeamjiyBZ1XYG9aDVbKHHv1IdbXE8wQYT6fd5cy2gcPUqKsHtdJPfSGvxS81LN8LuvS9TEfs9Tu2MUnv4U7dDk35WMyBN/bzGiWT2BykX8PQXk70L0t+SSVJX+Iyu5niudYFNpc+/RPFAo2KMyFp9E=

I remember watching you live prove during DeepSpec 2017. It's quite possible that the example was manufactured since it was educational, but one thing that stood out was that in trying to extend program-specific custom tactics to handle changes, you first made changes within individual proofs, then figured out how to bubble those changes out into your tactic going forward.

LLMs I think could be very helpful for figuring out those changes within individual proofs, but as of now, it is likely that a human would still need to observe that such repetition is occurring, then figure out how to generalize it inside of the custom tactic implementation. Though, I'm not certain of that; I think it would be a really interesting exercise for you to find someone with Copilot access, pick a project you're working on, and just hack with Copilot on for a long time, and see what it's good at and what it struggles with for your own style (with an extremely open mind, remembering that it ought to get better over time).

I'd be interested in a more systematic exploration of this for different proving styles if you're interested; I think it’s important to know as we head toward a future where these language models for programming and proving become much more widespread.

Talia

On Sat, Aug 21, 2021, 7:06 PM Adam Chlipala <adamc AT csail.mit.edu> wrote:

On 8/21/21 7:18 PM, Talia Ringer wrote:
> Language models do really well with enough context, and this is often
> neglected when people try them out, so they often abandon them early.
> I recently tried Copilot with Joe Cutler in the middle of CompCert to
> see what it does, and I think even a naive large language model like
> Codex with no semantic understanding at all does really well at
> synthesizing proofs inside of a large project specifically when you
> are in a repetitive file with many similar, small lemmas and slight
> variations between them. The syntax often captures many of those
> variations, and the LLM picks up on them easily. We will post a
> YouTube video on this Monday.
That sounds really interesting, though I'm wondering about the
comparison point of an Ltac script written by someone very familiar with
the code base, which would be used for all lemmas. That's kind of a way
of making a "style" first-class.

[Coq-Club] Proof automation class ideas, Talia Ringer, 08/19/2021
- Re: [Coq-Club] Proof automation class ideas, Michael Soegtrop, 08/19/2021
- Re: [Coq-Club] Proof automation class ideas, Adam Chlipala, 08/21/2021
  - Re: [Coq-Club] Proof automation class ideas, Lasse Blaauwbroek, 08/21/2021
    - Re: [Coq-Club] Proof automation class ideas, Adam Chlipala, 08/22/2021
      - Re: [Coq-Club] Proof automation class ideas, Jim Fehrle, 08/22/2021
      - Re: [Coq-Club] Proof automation class ideas, Lasse Blaauwbroek, 08/22/2021
  - Re: [Coq-Club] Proof automation class ideas, Talia Ringer, 08/22/2021
    - Re: [Coq-Club] Proof automation class ideas, Adam Chlipala, 08/22/2021
      - Re: [Coq-Club] Proof automation class ideas, Talia Ringer, 08/22/2021
        
        Re: [Coq-Club] Proof automation class ideas, Adam Chlipala, 08/22/2021
    - Re: [Coq-Club] Proof automation class ideas, Stefan Monnier, 08/24/2021
      - Re: [Coq-Club] Proof automation class ideas, Josef Urban, 08/24/2021
        
        Re: [Coq-Club] Proof automation class ideas, Talia Ringer, 08/25/2021
- Re: [Coq-Club] Proof automation class ideas, Jules Jacobs, 08/22/2021
  - Re: [Coq-Club] Proof automation class ideas, Jason Gross, 08/22/2021