For the best experience on desktop, install the Chrome extension to track your reading on news.ycombinator.com
Hacker Newsnew | past | comments | ask | show | jobs | submit | history | olasaustralia's commentsregister

The title doesn't match what's in the article. A better title would be 'Wikipedia's editors are mostly men and they don't have an understanding of non-heterosexual ideas or people'.

It seems like editors have arguments on these topics, according to the article. Not harassment on every topic under the sun


the issue seems to be, agenda driven groups are finding their attempts to change pages thwarted by content managers who are more concerned about accuracy. this is far more common on the English speaking version of Wikipedia than elsewhere.

and typical of the situation, when the agenda is thwarted regardless of reason fall back on the media to shame the group standing in the way and be sure to portray it in an offensive manner not withstanding the real issue


How do you go from "having arguments over topic x" to "having a poor understanding of topic x"?


How do you go from "posting pictures of genitalia on user pages" to "having arguments over topic x"?


Not rereading the article again, but I am pretty sure posting pictures of genitalia on user pages was not the only thing going on.

As a rule, just because an asshole or idiot disagrees with you, it doesn't make you right. Stop trying to use that as an argument.


> Not rereading the article again, but I am pretty sure posting pictures of genitalia on user pages was not the only thing going on.

On the other hand I'm pretty sure that "having arguments over topic x" wasn not the only thing going on either.

> As a rule, just because an asshole or idiot disagrees with you, it doesn't make you right. Stop trying to use that as an argument.

Where is that used as an argument?


A title like that would be flagged and do nothing but to further the flame of a gender war.


That's not how it works. (1-1%)^100=37%



> although Google Translate is extremely useful (and I use it all the time), it is true that it does not usually match the skills of the best human translators

I think it is safe to say that Google Translate almost never macthes the skills of human translators except for rare trivial cases. As a rule it's rubbish, and sometimes as an exception it can be OK. Nobody argues against its usefulness, especially considering it's free and that it's pretty much the only tool we all use to understand things written in other languages. Better than nothing for sure.

Further, the whole analogy with flying and birds completely misses the point. Airplanes do satisfactory job at transporting humans and cargo, they are not required to fly like birds or mate with birds. We have invented what seems to be a more efficient method of flying using engines rather than flapping. All in all, airplanes are not birds and were never meant to be.

Similarly, in my view the arguments regarding machine translation come down to two things today: firstly whether it's useful or not, and secondly wether ML improves translation or not. My answers are: (1) weeeell, yeah, it's kinda useful in a better-than-nothing kind of a way and (2) doesn't seem like a significant improvement to me, if at all.


> I think it is safe to say that Google Translate almost never macthes the skills of human translators except for rare trivial cases. As a rule it's rubbish, and sometimes as an exception it can be OK.

Depends on the language pair. Depends on the human translator. I met translators who wouldn't compete against 1990s Babelfish / SYSTRAN well. They still had some business.


I think it is unfair and even wrong to compare GT to the job of professional real time translation, because the latter is probably one of the most challanging jobs that I know of, next to astronauts and film stuntmen. Instead, what we mean most of the time is "this piece of text in the language I speak fluently that came out of Google Trabslate reads like nonsense to me, but I kinda get it (though sometimes I don't)". Meaning, it's way below my standards of quality for a language I know very well.


Who said anything about real time?


Google translate has lots of disadvantages but it is at least fast. Professional written translators manage a few tens of words a minute; professional simultaneous translators manage to keep up most of the time but are occasionally defeated by languages with very different sentence structures, and require frequent breaks.


This amazingly misses the point. Shallit's arguments with birds are the kind of arguments a student has before taking an AI class and feels rather smart about it.

Even though airplanes are different from birds, airplanes still need to follow the laws of physics and aerodynamics. Birds too need follow the laws of physics.

With intelligence, no one knows what the laws are. That is Hofstader's point.


I think he’s (deliberately? Give him the benefit of the doubt) misinterpreted Hofstadter’s use of the words “processing text”.

What he means, which seemed quite obvious to me, is that the machine is not reading text, building up a semantic interpretation of the sentence in the way a human does. That’s because a neural net is a simple pattern recognition machine that does not work in the same way as the brain. It doesn’t have the immense life experience to draw upon (probably relatively easy to fix) but more importantly it doesn’t have a concept of what a sentence actually means.

A neural network doesn’t have an understanding of what “double” means. It just pattern matches a translation.

I think there’s some serious symbolic reasoning going on in the brain, which neural nets don’t yet perform. It all feels like shallow syntactic matching right now, rather than semantic reasoning.

Unfortunately I don’t know how the brain works, so you end up in an absurd argument where people say “the brain is just a neural net” and it’s impossible to completely refute their claims, just as if someone said “the brain is a very large lookup table”. Well, from what I see it does seem that the brain is much smarter than that, but I can’t be certain without knowing what that extra kicker is. So whilst such an assertion seems terribly simplistic and self-evidently insufficient, it is difficult to argue with.


“The brain is just a neural net” plus real life experience, in fact years and years of experience. Pretty much every example that automated translation gets wrong is when a real life context is required. Our language is not just a sequence of words and sentences, it almost always implies some contextual klnowledge. Where two humans have siginficantly different backgrounds they may have difficulty understanding each other for the same reason. A total lack of real life experience on one of the sides makes it even worse: it produces barely comprehensible near-nonsense.


One problem with any attempt to map human intelligence to different types of artificial intelligence is that we only have a very precious few such types of AI, so there may be any number of things we're missing.

It's a case of not having the right analogies. We liken various bodily systems to machines: the heart is a pump, the lungs are funnels, the kidneys are filters, etc. These work up to some point because we understand both sides of the analogy well enough- we understand how the heart works, to the extent that it works like a pump and we understand how pumps work, etc.

But with intelligence we don't have this luxury. We don't understand how intelligence works, yet we draw an analogy not just with computers ("the brain is a computational device") but with specific types of computer programs. However, there are, literally, an infinite number of different computer programs and an unknown number of them could produce results similar, or even identical, to our intelligence.

Of course understanding intelligence is basically coming up with a good model of it. But that must be preceded by a good understanding of how intelligence works, which we currently don't have. Instead, what some researchers do, is that they take their arbitrarily chosen favourite AI model and try to find a way to argue that it's "like" human intelligence.

Neural networks are particularly guilty of this sort of thing. The whole idea of connectionism is to mimic the way the brain does intelligence, however we don't know what that is, so we've just come up with a complex machine that can optimise systems of functions, instead (I mean the set of neural network architectures). Then, when this machine turned out to be good at doing what it was designed for, optimising systems of functions, we claimed this as proof that it's actually doing what the brain does. That's a very circular way of thinking.


> When Hofstadter says "There's no fundamental reason that machines might not someday succeed smashingly in translating jokes, puns, screenplays, novels, poems, and, of course, essays like this one. But all that will come about only when machines are as filled with ideas, emotions, and experiences as human beings are", that is just an assertion. I can translate passages about war even though I've never been in a war. I can translate a novel written by a woman even though I'm not a woman.

When refuting his claims, he also makes some errors. Hofstadter has some very good reasons to justify what he says, whereas Shallit's argument is more or less "we can do slightly better translations now than before, so there is no reason they couldn't be better" - whereas the whole point of Hofstadter is that it's impossible to do exactly using current methods.

I understand both views and I think Shallit might have a point, but he doesn't justify it well. Simply by using statistical methods you can achieve surprisingly good results, although the error ratio is still too high IMO. What we already can do is to produce relatively good translations of texts belonging to well defined domains, such as legal documents. But in order to do it well with all domains, we'd need to find a good method of passing on the necessary context information to the translation engine.

Imagine translating subtitles of a movie. It's absurd to expect the machine will produce better results than a human as it's lacking visual cues. However, if we manage to transmit this information to the translation machine (via Computer Vision, audio profiling etc.), it can get much better results. It's very difficult to expect good results could be obtained just by training the neural networks based on previous movies. Yet, this is what Shallit seems to argue.


I was expecting this reply to be like "yes translation is not yet perfect but we have some ideas how to fix this, and the solution is different than what Hofstadter thinks". Instead it was more like "AlphaGo is different from humans, airplanes are different from birds, and deep translation is different(better?) from human translation."


And self-driving cars may not get drunk but they'll kill you in different ways than humans do (think software errors, or hacking).

He may actually have a point there.


I think Shallit is largely wrong. The Hof didn’t specifically cite syntax vs semantics but I think that’s what he’s saying and that’s what GT is doing.


The gist of his post seems to be:

>> If mediocre translations can be done now without the requirements Hofstadter imposes, there is just no good reason to expect that excellent translations can't be eventually be achieved without them, at least in the same degree that Hofstadter claims.

Of course, there's no good reason to expect that excellent translations _can_ eventually be achieved in this manner, either. We just have to wait and see, according to Jeffrey Shallit.

On the other hand, we can already see that GT is not just "mediocre"; it fails in specific ways that suggest fundamental weaknesses of the way it does translation. By analogy, it's like having a flying machine that can only fly as long as it's anchored to the ground. There's no good reason to expect that such a machine will ever be able to do anything else than fly around the same spot.

So Hofstatder's article is not a discussion of requirements for good translation, only. It's also a discussion of limitations of Google Translate, that have to be overcome before it can consistently offer good translations.


P


Ah well - HN needs a link submission tool similar to Reddit if there is a duplicate link


The website needs a name and email address to proceed but there is no authentication - so a temporary email address will do. The article is long to read at around 20 mins but worth it


Studio Ghibli also seems like a good bet


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

HN For You