[HN Gopher] A Multimodal Automated Interpretability Agent
___________________________________________________________________
A Multimodal Automated Interpretability Agent
Author : el_duderino
Score : 54 points
Date : 2024-07-24 12:42 UTC (10 hours ago)
(HTM) web link (arxiv.org)
(TXT) w3m dump (arxiv.org)
| empath75 wrote:
| https://arxiv.org/pdf/2404.14394
|
| Actual paper to save you from having to read the PR release.
| dang wrote:
| Ok, we'll change the URL to that from
| https://news.mit.edu/2024/mit-researchers-advance-
| automated-.... Users may still want to read the latter for a
| quick intro.
| curious_cat_163 wrote:
| > We think MAIA augments, but does not replace, human over- sight
| of AI systems. MAIA still requires human supervision to catch
| mistakes such as confirmation bias and image generation/editing
| failures. Absence of evidence (from MAIA) is not evidence of
| absence: though MAIA's toolkit enables causal interventions on
| inputs in order to evaluate system behavior, MAIA's explanations
| do not provide formal verification of system performance.
|
| For folks who are more familiar with this branch of literature,
| given the above, why is this a fruitful line of inquiry? Isn't
| this akin to stacking turtles on top of each other?
___________________________________________________________________
(page generated 2024-07-24 23:04 UTC)