bruturis's comments

bruturis · on March 26, 2024

Prior are now not subjective but useful, the OP is about the problem of choosing the best priors. The best options are informative priors (1) and regularizers (2). So, for example, choosing as prior a Laplace distribution for the unknown parameters is equivalent to the LASSO that is a well known way of obtaining sparse models with few coefficients. In (2) there is an example in which a prior suggest a useful regularization method for regression. In (3) the author discusses prior modeling.

(1) https://en.wikipedia.org/wiki/Prior_probability#Informative_...

(2) https://skeptric.com/prior-regularise/index.html

(3) https://betanalpha.github.io/assets/case_studies/prior_model...

bruturis · on March 25, 2024

The platform allows the agents to create new tools and organize knowledge. Is about 60000 lines of code, about six months 10 hours/day seven days a week. I hope my friend get the appropriate funding to continue with his project, as karpathy recommend in one of his talk, first get the system to obtain top capacity, then reduce costs.

bruturis · on March 25, 2024

By solving a hard problem or providing a new way of attacking some problem an agent can show behavior that mimic intelligence. What kind of problem should this system attack?

aristofun · on March 25, 2024

That is my point - you start from the wrong end. You first choose the specific problem and then measure against that problem.

bruturis · on March 21, 2024

Analogous code in J,

  /:~ s -. p [ p =: s*/s [ s=: 2+i.48

An exercise for numpy, test that GCD(x,y) * LCM(x,y) = x*y using 1000 random numbers in the range 0..99 for x e y.

  test =:  (* = *. * +.) & ?
  *./ test~  1000 # 100

cl3misch · on March 22, 2024

Thanks, that was fun.

    def d(x): N = arange(1, x + 1); return N[x % N == 0]
    def m(x, n): return x * arange(1, n + 1)
    def gcd(x, y): return max(set(d(x)) & set(d(y)))
    def lcm(x, y): return min(set(m(x, y)) & set(m(y, x)))
    def test(x, y): return gcd(x, y) * lcm(x, y) == x * y
    all([test(x, y) for (x, y) in randint(1, 100, (1000, 2))])  # True

I am not a good golfer. Now I want to look at the codegolf stackexchange for this...

bear8642 · on March 23, 2024

Equivalently in APL:

  ⎕io←0               ⍝ to match J
  test ← (× = ∨ × ∧) ∘ ?
  ∧⌿ test⍨ 1000⍴100

bruturis · on March 21, 2024

>> find the GCD (greatest common divisor) of the smallest and largest numbers in an array

Just for a short comparison, In J the analogous code is </ +. >/

  Where / is for reduce, +. is for the GCD,  the LCM is *. 

 The basic idea of J notation is using some small change to mean the contrary, for example {. for first and {: for last, {. for take and }. for drop (one symbol can be used as a unary or binary operator with different meaning.  So if floor is <. you can guess what will be the symbol for roof. For another example /:~ is for sorting in ascending order and I imagine that you can guess what is the symbol for sorting in descending order.  In a sense, J notation include some semantic meaning, a LLM could use that notation to try to change an algorithm.  So perhaps someone could think about how to expand this idea for LLM to generate new algorithms.

The matrix m, the sum of the rows, and the maximum of the sum of the rows in J (separated by ;)

  m ; (+/ m) ; >./ +/ m
  ┌─────┬───────┬──┐
  │0 1 2│9 12 15│15│
  │3 4 5│       │  │
  │6 7 8│       │  │
  └─────┴───────┴──┘

KarlKode · on March 22, 2024

I think you mistyped J code. I don't know any J but what I understood from your comment that it should be something like

  </ +. >/ *.

bruturis · on March 22, 2024

You are right, the correct code is .</ +. >./

To understand this you need to know that >. and <. are the min and max functions, and that in J three functions separated by spaces, f g h, constitutes a new function mathematically defined by (f g h)(x) = g(f(x), h(x)). An example is (+/ % #) which applied to a list gives the mean of the list. Here +/ gives the total, # gives the number of elements and % is the quotient.

kqr · on March 22, 2024

> So if floor is <. you can guess what will be the symbol for roof.

Based on the examples, no, I cannot. It could be either of <: and >.

bruturis · on March 22, 2024

You are right, both are good options, the author of J chose >. for ceiling and >: for greater than or equal.

bruturis · on March 21, 2024

I didn't know about his inequalities, but I found (1) that provides an example of using Talagrand inequality applied to the longest increasing subsequence problem (12 pages, easy to read). It seems to be a broad generalization of the Hamming distance.

(1) https://www.cs.purdue.edu/homes/hmaji/teaching/Spring%202018...

anthk · on March 21, 2024

I was about to say that. Also, check out this

http://doc.9gridchan.info/blog/181230.ncubic.routing http://doc.9gridchan.info/blog/190104.ncubic.algorithms

Fromm mycrovtif (RIP), a guy from the plan9/9front community.

bruturis · on March 12, 2024

I think that yours comment is very interesting, I have reflected many times about how to differentiate things that appear in the same context of things that are similar. Any big idea here could be the spark to initiate a great startup.

bruturis · on March 11, 2024

In Spanish we have ensayo = attempt or try, frequently is used to refer to attempts before the real show or performance (in a music festival, a theater, ..). Also ensayo is just equivalent to what Montaigned introduced when referred to a text composition.

maleldil · on March 11, 2024

Similarly, Portuguese's "ensaio", which means "rehearsal".

ricardobeat · on March 11, 2024

It also carries the same literary meaning, “ensaio sobre …” == essay, usually only seen in academia, or alternative cinema.