[Postgres-xl-bugs] assert failed on coord when commit

Jov amutu at amutu.com
Thu Jan 1 04:31:17 PST 2015


Thanks,Pavan!
This only happened once on my freebsd host,I now can't reproduce it now.
By the way,this is pgxc git head which is 9.4beta based,not pgxl.


Jov
blog: http:amutu.com/blog <http://amutu.com/blog>

2015-01-01 18:35 GMT+08:00 Pavan Deolasee <pavan.deolasee at gmail.com>:

> On Thu, Jan 1, 2015 at 3:57 PM, Pavan Deolasee <pavan.deolasee at gmail.com>
> wrote:
>
>> Hi Jov,
>>
>> Does this happen every time you run make check on FreeBSD? Also, does
>> this happen on Ubuntu or any other Linux platform? We haven't changed that
>> part of the code a lot so I am wondering if this could be some PG bug.
>>
>
>
> FWIW I've found two PG bugs which seem to be similar to this crash. We
> haven't yet merged those 9.2 minor releases in XL. So may be this get fixed
> when we do that.
>
> http://www.postgresql.org/message-id/6321.1406003087@sss.pgh.pa.us
> http://www.postgresql.org/message-id/26964.1396805001@sss.pgh.pa.us
>
> Thanks,
> Pavan
>
>
>>
>> Thanks,
>> Pavan
>>
>> On Wed, Dec 24, 2014 at 8:29 AM, Jov <amutu at amutu.com> wrote:
>>
>>> this happened when I make check on FreeBSD,the regression hangs here :
>>> parallel group (10 tests):  create_cast create_aggregate typed_table
>>> create_function_3 drop_if_exists create_table_like constraints
>>> updatable_views vacuum triggers
>>>      create_aggregate         ... ok
>>>      create_function_3        ... ok
>>>      create_cast              ... ok
>>>      constraints              ... ok
>>>      triggers                 ... ok
>>>      create_table_like        ... ok
>>>      typed_table              ... ok
>>>      vacuum                   ... ok
>>>      drop_if_exists           ... ok
>>>      updatable_views          ... ok
>>> test sanity_check             ... ok
>>> test errors                   ... ok
>>> test select                   ... ok
>>> parallel group (19 tests):  namespace select_distinct select_distinct_on
>>> random update select_having delete case select_implicit btree_index
>>> hash_index select_into union arrays subselect join portals aggregates
>>>
>>> the aggregates.out file:
>>>  -- check some implicit casting cases, as per bug #5564
>>> 1017 select string_agg(distinct f1, ',' order by f1) from varchar_tbl;
>>>  -- ok
>>> 1018  string_agg
>>> 1019 ------------
>>> 1020  a,ab,abcd
>>> 1021 (1 row)
>>> 1022
>>> 1023 select string_agg(distinct f1::text, ',' order by f1) from
>>> varchar_tbl;  -- not ok
>>> 1024 ERROR:  in an aggregate with DISTINCT, ORDER BY expressions must
>>> appear in argument list
>>> 1025 LINE 1: select string_agg(distinct f1::text, ',' order by f1) from
>>> v...
>>> 1026                                                           ^
>>> 1027 select string_agg(distinct f1, ',' order by f1::text) from
>>> varchar_tbl;  -- not ok
>>> 1028 ERROR:  in an aggregate with DISTINCT, ORDER BY expressions must
>>> appear in argument list
>>> 1029 LINE 1: select string_agg(distinct f1, ',' order by f1::text) from
>>> v...
>>> 1030                                                     ^
>>> 1031 select string_agg(distinct f1::text, ',' order by f1::text) from
>>> varchar_tbl;  -- ok
>>> 1032  string_agg
>>> 1033 ------------
>>> 1034  a,ab,abcd
>>> 1035 (1 row)
>>> 1036
>>> 1037 -- string_agg bytea tests
>>> 1038 create table bytea_test_table(v bytea);
>>> 1039 ERROR:  Failed to get pooled connections
>>> 1040 CONTEXT:  SQL statement "EXECUTE DIRECT ON (coord2) 'SELECT
>>> pg_catalog.pg_try_advisory_xact_lock_shared(65535, 0)'"
>>> 1041 select string_agg(v, '' order by v) from bytea_test_table;
>>> 1042 ERROR:  relation "bytea_test_table" does not exist
>>> 1043 LINE 1: select string_agg(v, '' order by v) from bytea_test_table;
>>> ---------------------------------------------
>>>
>>> the coord2 node is stoped and this is the coord2 log:
>>>
>>> STATEMENT:  alter table emp rename column salary to manager;
>>>  20 ERROR:  column name "oid" conflicts with a system column name
>>>  21 STATEMENT:  alter table emp rename column salary to oid;
>>>  22 TRAP:
>>> FailedAssertion("!(FastPathStrongRelationLocks->count[fasthashcode] > 0)",
>>> File: "lock.c", Line: 1278)
>>>  23 LOG:  server process (PID 44197) was terminated by signal 6: Abort
>>> trap
>>>  24 DETAIL:  Failed process was running: COMMIT TRANSACTION
>>>  25 LOG:  terminating any other active server processes
>>>  26 WARNING:  terminating connection because of crash of another server
>>> process
>>>  27 DETAIL:  The postmaster has commanded this server process to roll
>>> back the current transaction and exit, because another server process
>>> exited abnormally     and possibly corrupted shared memory.
>>>
>>> this is the coord_1 log:
>>>
>>>
>>> #0  0x00000008017466ca in thr_kill () from /lib/libc.so.7
>>> [New Thread 802006400 (LWP 101401/postgres)]
>>> (gdb) bt
>>> #0  0x00000008017466ca in thr_kill () from /lib/libc.so.7
>>> #1  0x000000080181b149 in abort () from /lib/libc.so.7
>>> #2  0x00000000009d9c49 in ExceptionalCondition (conditionName=0xb91937
>>> "!(FastPathStrongRelationLocks->count[fasthashcode] > 0)",
>>>     errorType=0xa5efed "FailedAssertion", fileName=0xb92449 "lock.c",
>>> lineNumber=1278) at assert.c:54
>>> #3  0x000000000084d905 in RemoveLocalLock (locallock=0x802291648) at
>>> lock.c:1278
>>> #4  0x000000000085023c in LockReleaseAll (lockmethodid=1, allLocks=0
>>> '\0') at lock.c:2119
>>> #5  0x0000000000856a77 in ProcReleaseLocks (isCommit=1 '\001') at
>>> proc.c:770
>>> #6  0x0000000000a13173 in ResourceOwnerReleaseInternal
>>> (owner=0x8021d3150, phase=RESOURCE_RELEASE_LOCKS, isCommit=1 '\001',
>>> isTopLevel=1 '\001')
>>>     at resowner.c:286
>>> #7  0x0000000000a12f83 in ResourceOwnerRelease (owner=0x8021d3150,
>>> phase=RESOURCE_RELEASE_LOCKS, isCommit=1 '\001', isTopLevel=1 '\001') at
>>> resowner.c:206
>>> #8  0x0000000000525ec9 in CommitTransaction () at xact.c:2391
>>> #9  0x0000000000525606 in CommitTransactionCommand () at xact.c:3223
>>> #10 0x0000000000873294 in finish_xact_command () at postgres.c:2550
>>> #11 0x00000000008710a5 in exec_simple_query (query_string=0x802098038
>>> "COMMIT TRANSACTION") at postgres.c:1162
>>> #12 0x000000000086fefa in PostgresMain (argc=1, argv=0x802090e68,
>>> dbname=0x802090c28 "regression", username=0x802090c08 "jovz") at
>>> postgres.c:4235
>>> #13 0x00000000007f4746 in BackendRun (port=0x80209a540) at
>>> postmaster.c:4270
>>> #14 0x00000000007f3d97 in BackendStartup (port=0x80209a540) at
>>> postmaster.c:3959
>>> #15 0x00000000007efbc0 in ServerLoop () at postmaster.c:1709
>>> #16 0x00000000007ed2a7 in PostmasterMain (argc=7, argv=0x7fffffffe6b0)
>>> at postmaster.c:1368
>>> #17 0x0000000000729c83 in main (argc=7, argv=0x7fffffffe6b0) at
>>> main.c:196
>>>
>>> Jov
>>> blog: http:amutu.com/blog <http://amutu.com/blog>
>>>
>>>
>>> ------------------------------------------------------------------------------
>>> Dive into the World of Parallel Programming! The Go Parallel Website,
>>> sponsored by Intel and developed in partnership with Slashdot Media, is
>>> your
>>> hub for all things parallel software development, from weekly thought
>>> leadership blogs to news, videos, case studies, tutorials and more. Take
>>> a
>>> look and join the conversation now. http://goparallel.sourceforge.net
>>> _______________________________________________
>>> Postgres-xl-bugs mailing list
>>> Postgres-xl-bugs at lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/postgres-xl-bugs
>>>
>>>
>>
>>
>> --
>> Pavan Deolasee
>> http://www.linkedin.com/in/pavandeolasee
>>
>
>
>
> --
> Pavan Deolasee
> http://www.linkedin.com/in/pavandeolasee
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.postgres-xl.org/private.cgi/postgres-xl-bugs-postgres-xl.org/attachments/20150101/405f6503/attachment.htm>


More information about the postgres-xl-bugs mailing list