More

keonix · on Dec 9, 2023

If I regurgiate something I read in copyrighted book without proper license that also would be theft, no distinction there.

I'm not distributing my brain, at least same (but probably more restrictive) should apply to models - training is okay, but using and distributing should be limited by copyright

broken-kebab · on Dec 9, 2023

Explaining anything publicly based on my understanding I got reading books would be illegal following this logic. I'm not sure this is how it works.

visarga · on Dec 9, 2023

They want to muddle the distinction between ideas and expression. You can't copyright ideas. Everyone is entitled to copy ideas.

keonix · on Dec 9, 2023

It would not be illegal based on fair use (though you have to be careful there also), but if you try to regurgiate large portions of the book then it would be. And we do know that models regurgiate training material verbatim (Copilot)

keonix · on Nov 12, 2023

Wait until you hear about frankenmodels. You rip parts of one model (often attention heads) and transplant them in another and somehow that produces coherent results! Witchcraft

https://huggingface.co/chargoddard

GaggiX · on Nov 12, 2023

>somehow that produces coherent results

with or without finetuning? Also is there a practical motivation for creating them?

keonix · on Nov 12, 2023

> with or without finetuning?

With, but it's still bonkers that it works so well

>Also is there a practical motivation for creating them?

You could get in-between model sizes (like 20b instead of 13b or 34b). Before better quantization it was useful for inference (if you are unlucky with vram size), but now I see this being useful only for training because you can't train on quants

ShamelessC · on Nov 13, 2023

> With, but it's still bonkers that it works so well

Ehhhh…

keonix · on Oct 11, 2023

I assume it's because such large context takes lots of memory, so you might as well have smarter model if you are not gonna fit in small vram anyway

brucethemoose2 · on Oct 11, 2023

Personally, I have found that Mistral 7B (with its native 8K context, and decent results stretched out even more) is performing much better than llama 13B tunes for storytelling, where that long context is really important.

And I think the optimized backends should implement that sliding 16k context soon...

Anyway, point is a huge context really helps certain types of queries, and VRAM usage is reasonable with a 7B model.

keonix · on Oct 6, 2023

Mistral 7B ~ 8 GiB

StableLM 3B ~4 GiB

You could go even lower with smaller quantization if necessary. I personally wouldn't use anything smaller than 7B and Mistral already pushing it in coherence. Overall it depends on your use case, not everyone needs smart models, or large context that sometimes takes half of required memory.

Codellama is also surprisingly good even for non-coding tasks

keonix · on Sept 9, 2023

For what it's worth, I think OP has a point in showing inconsistencies in your logic

keonix · on May 25, 2023

> US isn't donating weapons to Ukraine, it's leasing them

Could you kindly support your statement with the source? AFAIK most aid provided to Ukraine is not under lend-lease but donated to them

kibwen · on May 25, 2023

https://en.m.wikipedia.org/wiki/Ukraine_Democracy_Defense_Le...

"Any loan or lease of defense articles to the Government of Ukraine under paragraph (1) shall be subject to all applicable laws concerning the return of and reimbursement and repayment for defense articles loan or leased to foreign governments."

Can you provide a source for your statement that these were donations?

keonix · on May 26, 2023

That's disingenuous. You provided link to a legal framework, but not any use of it. Your source doesn't even mention a single delivery under this act.

> Can you provide a source for your statement that these were donations?

You made the first claim, it's on you to prove it because disproving something is harder than making false claim. But you can have my source anyway.

https://www.politico.com/newsletters/national-security-daily...

mrguyorama · on May 25, 2023

Most aid takes the form of cash for humanitarian support, ~disposing of~ donating old hardware to Ukraine, or a loan for the express intent of buying a modern system.

The vast majority of the "billions" given to Ukraine is made up of stuff that we actively are trying to throw away.

keonix · on Oct 18, 2022

IIRC temperature is proportional to velocity squared so increase in simulation speed will result in squared increase in temperature. But even if we account for that we are not dealing with ideal gas in real world, there bound to be collisions that break molecules apart or even start nuclear fusion at certain speeds

keonix · on July 27, 2022

Sure! It's just a classification of events that each person has.

Testible prediction? Socially close people will have similar classifications as opposed to randomly chosen ones (I will leave defining metric space over possible classifications as an exercise to the reader)

galacticaactual · on July 28, 2022

So it’s completely subjective to you, AKA not anywhere near the testable physics that I was looking for.

keonix · on July 28, 2022

> So it’s completely subjective to you

No? My prediction holds even if I didn't exist.

> not anywhere near the testable physics that I was looking for.

And now you are moving the goalpost.

keonix · on July 1, 2022

GPL restricts only developers. As an end user restrictions don't even apply to you

keonix · on June 25, 2022

Nothing changes, it is still a violation