pl.aiwright - GPT-4 dialogue for Disco Elysium: The Final Cut

Research into efficient optimization techniques seems pretty important given the scale of LLMs these days. Nice to see a second-order approach that achieves reasonable wall-clock improvements.

nsa@kbin.social · 1 year ago

Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training

nsa@kbin.social · 1 year ago

If there isn’t any discussion on reddit (no discussion in this case), I don’t see a reason to link to reddit; you can just link to the project page. That said, if you think there is important discussion happening that is helpful for understanding the paper, then use a teddit link instead, like:

https://teddit.net/r/MachineLearning/comments/14pq5mq/r_hardwiring_vit_patch_selectivity_into_cnns/

nsa@kbin.social · 1 year ago

Please don’t post links to reddit.

nsa@kbin.social · 1 year ago

It seems like for creative text generation tasks, metrics have been shown to be deficient; this even holds for the new model-based metrics. That leaves human evaluation (both intrinsic and extrinsic) as the gold standard for those types of tasks. I wonder if the results from this paper (and other future papers that look automatic CV metrics) will lead reviewers to demand more human evaluation in CV tasks like they do for certain NLP tasks.

nsa@kbin.social · 1 year ago

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

nsa@kbin.social · 1 year ago

hmmm… not sure which model you’re referring to. do you have a paper link?

nsa@kbin.social · 1 year ago

do you have a link?

nsa@kbin.social · 1 year ago

@Koffindodjer indeed you are!

nsa@kbin.social · 1 year ago

Extending Context Window of Large Language Models via Positional Interpolation

nsa@kbin.social · 1 year ago

Inverse Scaling: When Bigger Isn't Better

nsa@kbin.social · 1 year ago

Craft an Iron Sword: Dynamically Generating Interactive Game Characters by Prompting Large Language Models Tuned on Code

nsa@kbin.social · 1 year ago

r/MachineLearning finally received a warning from u/ModCodeOfConduct

nsa@kbin.social · 1 year ago

If the effect is strong enough, then it could have a very negative effect on LLM training in the near future, considering more and more of the internet contains ChatGPT & GPT-4 content in it and automatic detectors are currently quite poor.

nsa