

Programming.dev is hosting Iceshrimp: https://bytes.programming.dev
You could host your own instance, or if your opinion-pieces are programming related, post them there
Programming.dev is hosting Iceshrimp: https://bytes.programming.dev
You could host your own instance, or if your opinion-pieces are programming related, post them there
It’s called embeddings in other models as well:
https://huggingface.co/blog/getting-started-with-embeddings
https://ollama.com/blog/embedding-models
Also some feedback, a bit more technical, since I was trying to see how it works, more of a suggestion I suppose
It looks like you’re looping through the documents and asking it for known tags, right? ({str(db.current_library.tags)}.
)
I don’t know if I would do this through a chat completion and a chat response, there are special functions for keyword-like searching, like embeddings. It’s a lot faster, and also probably way cheaper, since you’re paying barely anything for embeddings compared to chat tokens
So the common way to do something like this in AI would be to use Vectors and embeddings: https://platform.openai.com/docs/guides/embeddings
So - you’d ask for an embedding (A vector) for all your tags first. Then you ask for embeddings of your document.
Then you can do a Nearest Neighbor Search for the tags, and see how closely they match
Recruitment is now basically Dead Internet theory…
It gives an example:
For example, with the phrase “My favorite tropical fruits are __.” The LLM might start completing the sentence with the tokens “mango,” “lychee,” “papaya,” or “durian,” and each token is given a probability score. When there’s a range of different tokens to choose from, SynthID can adjust the probability score of each predicted token, in cases where it won’t compromise the quality, accuracy and creativity of the output.
So I suppose with a larger text, if all lists of things are “LLM Sorted”, it’s an indicator.
That’s probably not the only thing, if it can detect a bunch of these indicators, there’s a higher likelihood it’s LLM text
Because Wordpress is also hosting 1000s of plugins that WP engine users can install.
I’m not sure what the license regarding those things is, WP engine could probably just mirror it -
But they basically got locked out of the default ecosystem infrastructure.
Since others already suggested mostly on-topic suggests, here’s an alternative suggestion:
Instead of looking specifically for a mentor - look for an open source project that you can help with. Ideally one with a discord or something to it’s easy to be in contact the the lead dev. A lot people don’t mind mentoring juniors, but in my experience it doesn’t happens that explicitly - “be my mentor” - and it might sound like you’re asking them a lot.
If you invert it into “Hey I wanna help you with your open-source project, but I don’t really know what to do, what your expectations are, how to implement a specific feature” - then you’re offering to do work them, instead of asking for something. And implicitly you’ll get mentorship in return.
And “real” projects probably also look better on your github / portfolio than only some dummy projects for learning purposes
Yea, I agree.
Also what’s the point now? At least a couple years ago we got a pretty cool t-shirt. Now we’re just getting a digital badge…?
40% of you are getting paid for this…? 🫠
That doesn’t really work all the time, because large files or large commits are lazy loaded on scroll, so what you’re searching might not have loaded yet
The code search does a server side search
Omg it’s sooo daammmn slooow it takes around 30 seconds to bulk - insert 15000 rows
Do you have any measurements on how long it takes when you just ‘do it raw’? Like trying to do the same insert though SQL Server Management Studio or something?
Because to me it’s not really clear what’s slow. Like you’re complaining specifically about the Microsoft ODBC driver - but do you base that on anything? Can you insert faster from Linux or through other means?
Like if it’s just ‘always slow’ it might just be the SQL Server. If you can better pinpoint when it’s slow, and when it’s fast(er) that probably helps to tell how to speed it up
When I stopped, subversion was what we used. I’m trying to understand Git, but it’s a giant conceptual leap.
It’s probably not ‘that much of a leap’ as you imagine. If you’re looking at Git tutorials, they’re usually covering all kinda complex scenarios of how to ‘properly use Git’. But a lot of people barely care about ‘properly using Git’ and they just kinda use it as a substitute for SVN… You create branches, you merge them back and forth, and that’s about it.
Like if you want to contribute to an open source project, all you have to do is create a fork (your own branch in SVN terms) - commit some stuff to it, and create a pull request (request to have your changes merged) back to the original branch. git pull
is just svn update
- getting someone elses commits
Not saying there aren’t more complex features in git, or that learning git properly isn’t worth it, just saying, I don’t think you have to see it as a ‘giant conceptual leap’ that’s preventing you from jumping back into programming. Easiest approach just to get started would be probably to just download a GUI like Sourcetree or Fork, and you just kinda pretend you’re still using SVN - approach wise
Problem Details for HTTP APIs - I have to work and integrate with a lot of different APIs and different kinda implementations of error handling. Everyone seems to be inventing their own flavor of returning errors.
My life would be so much easier if everyone just used some ‘global unified’ way to returning errors, all in the same way
What are you building, it depends a bit on your usecase
Otherwise c# Blazor compiles to WASM
If it’s a public repo, revoke the key (on your own/company repo it might not matter so much)
Then
git reset head~1
git push - f
Right… well clearly I have clicked all the links, and read all the things, and I still don’t understand it. “Never attribute to malice that which is adequately explained by stupidity.”
So assume in good faith, assume I have absolutely no idea what the problem even is due of my own stupidity. So ELI5 and give a synopsis of the problem.
Did you stop reading or are you intentionally trying to phrase it as if the universities won’t do anything? […]
It doesn’t seem like you’re even trying to make a good faith argument.
My first sentence and first 4 words are “So what’s stopping them?”. So did you stop reading before that - or what are you even arguing about, and where is your ‘good faith’? You’re arguing about meta-nonsense without answering
What’s stopping them? What do they even need from “federation” or “ActivityPub” to just build this?
So what’s stopping them? Universities have internship programs and internal projects. In a university team of 4 people doing projects, 63x4 252 students could be assigned to a project to build this.
But
The french open science committee (CoSO) is indeed interested in the ActivityPub implementation in GitLab
Good phrasing. They are “interested in the ActivityPub implementation” not “interested in the implementing ActivityPub” - so who gives a shit what a bunch of universities are interested in
“Don’t attribute to malice what is easily explained by incompetence”
So yea Mozilla wrote some terms that where ambiguous and could be interpreted in different ways, and ‘many people believed’ that they did this intentionally and had the worst intentions possible by their interpretation of the new ToS
Then Mozilla rewrote that ToS after seeing how people were interpreting the original ToS:
https://www.theverge.com/news/622080/mozilla-revising-firefox-terms-of-use-data
And yea, now ‘many people will believe’ that ‘Mozilla revised their decision to do this after the backslash’ - OR, it was never their intention and now phrased it better after the confusion
People just want to get their pitchforks out and start drama at any possible opportunity without evidence of wrongdoing… Mozilla added stupid stuff to the ToS, ok yea fair enough - but if they actually did “steal user data” - this would be very easily detectable with Wireshark or something