menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Wall (6,969 threads)

Tips

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Latest messages subdirectory_arrow_right

Wezel

9 hours ago

feedback

saverius

13 hours ago

subdirectory_arrow_right

sharptoothed

14 hours ago

subdirectory_arrow_right

small_snow

17 hours ago

feedback

sharptoothed

2 days ago

subdirectory_arrow_right

CK

3 days ago

subdirectory_arrow_right

Pfirsichbaeumchen

4 days ago

feedback

Aidys

5 days ago

feedback

CK

5 days ago

feedback

Thadh

13 days ago

jan_Ketalija jan_Ketalija February 6, 2024 February 6, 2024 at 6:47:35 AM UTC link Permalink

I've also been wondering. Why is there such a large Kabyle speaking community on Tatoeba? I haven't seen Kabyle anywhere else on the internet really, even on linguistic-based websites. What about Tatoeba has such a draw for Kabyle speakers? (The same goes for other Berber languages, but Kabyle is the one I see the most.)

{{vm.hiddenReplies[40491] ? 'expand_more' : 'expand_less'}} hide replies show replies
Thanuir Thanuir February 6, 2024 February 6, 2024 at 8:27:34 AM UTC link Permalink

Tatoeba on vapaaehtoisuuteen perustuva verkkosivu. Käytännössä, jos joku innostuu siitä ja värvää aktiivisesti muita, voi syntyä hyvä kierre jossa kyseinen kieli tai kulttuuri tuntuu yliedustetulta. Vastaavasti moni muu kieli voi olla raskaasti aliedustettuna, koska kukaan ei ole vain sattunut innostumaan verkkosivusta tai innostus on hiipunut.

Mutta tämä on aivan luonnollista, koska osallistujamäärät ovat pieniä. Tällöin satunnaisvaihtelu on suhteessa suurta.

Igider Igider February 6, 2024, edited February 6, 2024 February 6, 2024 at 9:19:41 AM UTC, edited February 6, 2024 at 9:19:58 AM UTC link Permalink

You've created the profil very recently.

:-)

Cabo Cabo February 8, 2024, edited February 8, 2024 February 8, 2024 at 6:26:13 PM UTC, edited February 8, 2024 at 6:26:21 PM UTC link Permalink

They found this place to conquer while ppl let it to happen.

{{vm.hiddenReplies[40496] ? 'expand_more' : 'expand_less'}} hide replies show replies
Thanuir Thanuir February 9, 2024 February 9, 2024 at 5:07:44 PM UTC link Permalink

Ei Tatoebaa voi vallata. Toisen kielen lauseet eivät vaikuta elämääsi mitenkään, jos et halua.

{{vm.hiddenReplies[40497] ? 'expand_more' : 'expand_less'}} hide replies show replies
February 9, 2024, edited February 9, 2024 February 9, 2024 at 6:54:32 PM UTC, edited February 9, 2024 at 6:58:58 PM UTC link Permalink
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

{{vm.hiddenReplies[40499] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cangarejo Cangarejo February 10, 2024 February 10, 2024 at 11:07:10 AM UTC link Permalink

On the Advanced Search page, there's an option that allows you to limit results to a single user. That allows you to only see sentences that belong to your friends.

{{vm.hiddenReplies[40500] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cabo Cabo February 10, 2024 February 10, 2024 at 3:36:38 PM UTC link Permalink

I know what the search query can do.
And you said false information.
"...limit results to a single user."
as you said, to a SINGLE USER
"That allows you to only see sentences that belong to your friends. "
That doesn't allow me to see ONLY THOSE sentences.

And who talked about a single user???
I talked about a whole group of users.

And what more, ignoring something what made the corpus less balanced, as Ibdx said: https://tatoeba.org/en/wall/sho...#message_40485
"The years 2017 and 2018 were years in which Tatoeba's main English-speaking contributor added hundreds of thousands of sentences in bulk.These sentences were mostly built according to syntactic patterns and used wildcards to avoid creating paraphrases that differ only in their named entities. These massive additions have greatly reduced the lexical diversity of the English corpus and increased the proportion of sentences containing pervasive words from 20% to 40%. This sudden change coincides with a sharp drop in the number of active contributors to Tatoeba."
... doesn't help.

{{vm.hiddenReplies[40501] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cangarejo Cangarejo February 10, 2024, edited February 10, 2024 February 10, 2024 at 3:45:38 PM UTC, edited February 10, 2024 at 10:20:32 PM UTC link Permalink

You can do multiple searches if you want multiple users; and Tatoeba could improve the search engine in the future to allow restricting results to given groups of users or to allow excluding groups of users. Don’t you usually translate Pfirsichbaeumchen’s sentences?

I think any imbalance in the corpus can be fixed with improved search features and using lists. There’s room for everyone.

{{vm.hiddenReplies[40502] ? 'expand_more' : 'expand_less'}} hide replies show replies
sundown sundown February 11, 2024 February 11, 2024 at 8:06:36 AM UTC link Permalink

> I think any imbalance in the corpus can be fixed with improved search features and using lists. There’s room for everyone.

The idea that improved search will sort out the imbalance in the corpus I find overoptimistic, to say the least.

{{vm.hiddenReplies[40507] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cangarejo Cangarejo February 12, 2024, edited February 12, 2024 February 12, 2024 at 8:58:07 AM UTC, edited February 12, 2024 at 3:48:41 PM UTC link Permalink

What about Ibdx’s solution, or hand-crafted lists?

{{vm.hiddenReplies[40516] ? 'expand_more' : 'expand_less'}} hide replies show replies
sundown sundown February 14, 2024 February 14, 2024 at 8:29:23 AM UTC link Permalink

I agree with lbdx that a sentence limit should be introduced. It's the least that should be done.

Cabo Cabo February 11, 2024 February 11, 2024 at 1:26:35 PM UTC link Permalink

If you don't want to see it, it's still there.
Well, just wear blinders then.

"Don’t you usually translate Pfirsichbaeumchen’s sentences?"
Huh? So you think I'm maaster now. I'm Cabo, it's written on the top of the message block. And what if someone has a dedicated user whose sentences he/she likes to translate? Does he/she have no right for opinion? Just translate and care for nothing else?

And thank for the one who blocked my message?
Was it the word shite? Or just not happy what I had to say?

{{vm.hiddenReplies[40509] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cangarejo Cangarejo February 12, 2024, edited February 12, 2024 February 12, 2024 at 8:54:46 AM UTC, edited February 12, 2024 at 2:52:02 PM UTC link Permalink

> So you think I'm maaster now?

I did confuse you with maaster.

> And what if someone has a dedicated user whose sentences he/she likes to translate?

I wasn’t criticizing maaster. I was just saying that you can already restrict the sentences you see to those belonging to any one user you want.

February 12, 2024, edited February 12, 2024 February 12, 2024 at 8:49:48 AM UTC, edited February 12, 2024 at 8:50:05 AM UTC link Permalink
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

sharptoothed sharptoothed February 11, 2024 February 11, 2024 at 4:38:40 PM UTC link Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

CK CK February 10, 2024 February 10, 2024 at 9:44:39 PM UTC link Permalink

🍎 Stats : An attempt at counting the number of active contributing usernames per year

2024 (268) * in progress
2023 (1165)
2022 (1225)
2021 (1366)
2020 (1520)
2019 (1304)
2018 (1465)
2017 (1414) * the year the SSD died. https://blog.tatoeba.org/2017/
2016 (1899)
2015 (2248)
2014 (2040)
2013 (2289) * the peak
2012 (2179)
2011 (2032)
2010 (1512)
2009 (155)
2008 (85)
2007 (20)

Here are the number of usernames owning sentences without valid dates (early entries in the database).
0000 (1515)

** Notes

This is based on data harvested from th 2024-02-10 sentences_detailed.csv file.

Note that this is not actually the number of active usernames from each year.

The usernames counted are the ones who currently "own" sentences added in those years, not really the contributors, since "orphan" sentences can be adopted. This means that the year the Tanaka Corpus sentences were imported shows a lot more contributors than there actually were. Many of those who have adopted these sentences joined the project much later.

{{vm.hiddenReplies[40506] ? 'expand_more' : 'expand_less'}} hide replies show replies
Yorwba Yorwba February 11, 2024 February 11, 2024 at 4:37:21 PM UTC link Permalink

If you want to count contributors active in a given year, you should probably analyze the contributions.csv file instead.

February 7, 2024 February 7, 2024 at 11:30:18 AM UTC link Permalink
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

Igider Igider February 5, 2024, edited February 9, 2024 February 5, 2024 at 11:22:47 PM UTC, edited February 9, 2024 at 5:59:23 PM UTC link Permalink

You've raised a very good question. By the way, the Berber language as defined in Tatoeba is a catch-all, because 80% of its phrases are Kabyle phrases but with a mixture of other Berber words. This mixture is not based on any linguistic reality, only ideology. The "Kabyle Berberists" who created this ideology think that by imposing their Kabyle language and mixing it with 20% of other languages, they will be able to unite all Berbers.

By the way, everything you can find in the way of novels, poetry, theatre, websites, music... It's 70% Kabyle. Because the Kabyle language has been transcribed into Latin since the 18th century. And there are 12 million Kabyles. Even the flag that Tatoeba's admins have imposed on the Kabyle language is not the right one, but rather the flag of all Berbers. The Kabyle flag has been removed from Tatoeba, since that. So the ideology feeds on another ideology. But the Kabyle language will progress, that's for sure. You only have to look at the digital fields in which the Kabyle language is used for localisation, learning and so on.

There are several sites devoted to the Kabyle language, starting with Wikipedia.

https://kab.m.wikipedia.org/wiki/Asebter_agejdan

sharptoothed sharptoothed January 28, 2024 January 28, 2024 at 8:23:31 AM UTC link Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

CK CK January 27, 2024 January 27, 2024 at 8:49:45 AM UTC link Permalink

🍎 Tatoeba.org Native Speakers with Native Language Sentences

http://tatoeba.ueuo.com/stats-2024-01-27.html

Find native speakers of languages you are studying and get links to their native language sentences.

The link above is to a page that shows only the 3,999 contibutors with 20 or more native-speaker sentences.

These members have contributed 99.7% of the native-speaker sentences.

If you want to see all such contributors, try the following link.

There is a lot of data, so the page will be slower to load, and may possibly not work on some devices.

http://tatoeba.ueuo.com/stats-2024-01-27all.html

Updated: 2024-01-27

eeyinn eeyinn January 17, 2024 January 17, 2024 at 1:30:44 AM UTC link Permalink

Hello,
I would like to offer translations in 2 cousin languages i am studying, of which i seem to be the only speaker or student on Tatoeba (L2) and whose way of constructing sentences is so different from other language corpuses I've contributed to on Tatoeba that I'm not sure how to proceed.
In Muscogee and in Hitchiti, a sentence like "Tom had an idea" isn't a sentence. Sentences start and end with entire "scenes" like in a movie. In a paragraph-length text, you don't have many sentences back to back, each conjugated for person and tense and such. They are meant to start and end with the "scene" you are you describing, composed of many short thoughts stitched together, with barely conjugated verbs throughout a typically very long sentence, and fully conjugated verbs only appear at the very end of the sentence where the "scene" you're narrating finally concludes a paragraph or two later. So when I get a sentence like "Tom had a great idea", that is not a whole scene and so It doesn't actually feel right to conjugate it as, let's say "Tom-ke vkerrickv herēmahēn hayvtēs." (tom mad a very good idea, conjugated to be very long ago). But, if conjugate it as the "clause/fragment" that Muscogee would treat it as, it wouldn't be a complete sentence, and so it wouldn't really include the period at the end of the sentence, which are always at the end of Tatoeba sentences as a matter of policy.

So I guess, what I'm asking is, what should I do? Should I conjugate these short sentences as all occurring in the recent past and constituting a full story, against the language's actual syntax, or should I write them as the clauses that the language would make them into being in a fuller-fledged sentence, but not actually provide "sentence-bearing complete sentences" as Tatoeba makes all of its entries?

Thankful for the Tatoeba project. 🙏

{{vm.hiddenReplies[40471] ? 'expand_more' : 'expand_less'}} hide replies show replies
Objectivesea Objectivesea January 17, 2024 January 17, 2024 at 3:53:06 AM UTC link Permalink

By its nature, Tatoeba leans towards short, complete utterances, even though such short sentences are obviously removed, in most cases, from a larger context. It is possible to construct scenes by assembling Tatoeba sentences into a defined order, and there are sometimes paragraph-long Tatoeba entries consisting of ten or more sentences, but these are frowned on as being unlikely to ever be tackled for translation. Also in such cases, we can run into a limit on the number of characters to be represented, and this can make a Tatoeba translation into certain languages impossible.

As a practical matter, I like to think of Tatoeba as a tool for the language learner and not as a comprehensive multilingual encyclopædia of all possible utterances. For me, in such a tool, sentences should be as short as possible while still conveying some meaning.

For me, a minimal sentence has a noun and a verb. Then there are transitive sentences with a direct and/or indirect object. A few well-chosen adjectives and adverbs add some spice, and then coordinate and subordinate conjunctions link related thoughts together. This works well for most European languages. Different structures are likely necessary for agglutinative languages, and I understand that North American Indigenous languages make use of a fourth-person pronoun on occasion.

Your contributions in Muscogee and Hitchiti will be valuable to Tatoeba, as I think we are unlikely to find a native speaker, which would have been ideal. I would suggest that you begin with the simplest of sentences which can be unambiguously represented in those languages and which are grammatically correct, even though they may convey only a little of the flavour of an Indigenous Knowledge Keeper , Elder or Matriarch relating the oral history from his or her ancestors.

Thanuir Thanuir January 17, 2024, edited January 17, 2024 January 17, 2024 at 7:26:45 AM UTC, edited January 17, 2024 at 7:29:43 AM UTC link Permalink

With Kven, which I read but can not use fluently, I add simple sentences I am sufficiently certain of and then translate to Finnish, rather than trying to translate from other languages to Kven. Maybe this approach works for you, too - try to put in example sentences you have strong reasons to believe are correct, and then translate those to your native language(s).

As a bonus, you are likely to increase the diversity of the corpus by getting culture-specific sentences and not getting Tom everywhere.

brauchinet brauchinet January 19, 2024, edited January 19, 2024 January 19, 2024 at 9:53:33 AM UTC, edited January 19, 2024 at 9:56:32 AM UTC link Permalink

Very interesting.
I think you are the best person to decide how to do it, and there isn’t much to be done wrong.
Tatoeba states that sentences should be natural sounding. If you feel that a "sentence" would be natural sounding within a specific context, it might be okay to add it that way even when the verb isn’t conjugated und the context isn’t known. If it sounds like „…have an idea and …“ and people would react „huh – and what?“, it would not be a suitable sentence.
The definition of sentence will vary between languages. If grammar requires that you always conjugate verbs, utterances without such a verb might not be considered full sentences. I unconjugated verbs are just normal, such a definition doesn’t make much sense. It’s true that Tatoeba sentences require a full stop, mostly because most languages require a full stop at the end of sentences. So for the sake of consistency and to avoid comments and questions, couldn’t you simply add full stops? I wonder if there is really a Muscogee rule saying you mustn’t use a full stop.

Sometimes one has to “make up” some extra information in the target language that is in the source language. For example, I heard some languages require that speakers choose between different verb forms indicating the degree of certainty. Since this information isn’t in an English sentence you need to decide yourself when translating.
So in your case, you could also add more than one possible translation and annotate it in the comment section, one as whole scene in the distant past and one as part of an (imaginary) scene.

jlake jlake January 17, 2024 January 17, 2024 at 3:25:09 AM UTC link Permalink

old style was way better , i mean vertically